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Description 

FIELD OF THE INVENTION 

5 The present invention relates generally to stochastic methods for synthesizing random oligomers, with particular 

emphasis on particle-based synthesis methods. The invention also relates to the use of identification tags on the par- 
ticles to facilitate identification of the oligomer sequence synthesized. 

BACKGROUND OF THE INVENTION 

10 

The relationship between structure and activity of molecules is a fundamental issue in the study of biological sys- 
tems. Structure-activity relationships are important in understanding, for example, the function of enzymes, the ways 
in which cells communicate with each other, and cellular control and feedback systems. Certain macromolecules are 
known to interact and bind to other molecules having a very specific three-dimensional spatial and electronic distribu- 
15 tion. Any large molecule haying such specificity can be considered a receptor, whether the molecule is an enzyme 
catalyzing hydrolysis of a metabolic intermediate, a cell-surface protein mediating membrane transport of ions, a glyc- 
oprotein serving to identify a particular cell to its neighbors, an IgG-class antibody circulating in the plasma, an oligo- 
nucleotide sequence of DNA in the genome, or the like. The various molecules that receptors selectively bind are 
known as ligands. 

20 Many assays are available for measuring the binding affinity of known receptors and ligands, but the information 

that can be gained from such experiments is often limited by the number and type of available ligands. Novel ligands 
are sometimes discovered by chance or by application of new techniques for the elucidation of molecular structure, 
including x-ray crystal log raphic analysis and recombinant genetic techniques for proteins. 

Small peptides are an exemplary system for exploring the relationship between structure and function in biology. 

25 a peptide is a polymer composed of amino acid monomers. When the twenty naturally occurring amino acids are 
condensed into polymeric molecules, the resulting polymers form a wide variety of three-dimensional configurations, 
each resulting from a particular amino acid sequence and solvent condition. The number of possible pentapeptides of 
the 20 naturally occurring amino acids, for example, is 20 5 or 3.2 million different peptides. The likelihood that molecules 
of this size might be useful in receptor-binding studies is supported by epitope analysis studies showing that some 

30 antibodies recognize sequences as short as a few amino acids with high specificity. Furthermore, the average molecular 
weight of amino acids puts small peptides in the size range of many currently useful pharmaceutical products. Of 
course, larger peptides may be necessary for many purposes, and polypeptides having changes in only a small number 
of residues may also be useful for such purposes as the analysis of structure-activity relationships. 

Pharmaceutical drug discovery is one type of research that relies on studies of structure-activity relationships. In 

35 most cases, contemporary pharmaceutical research can be described as the process of discovering novel ligands with 
desirable patterns of specificity for biologically important receptors. Another example is research to discover new com- 
pounds for use in agriculture, such as pesticides and herbicides. 

Prior methods of preparing large numbers of different oligomers have been painstakingly slow when used at a 
scale sufficient to permit effective rational or random screening. For example, the -Merrifield* method Merrifield, J. Am. 

40 Chem. Soc . 85:21 49-21 54 (1 963), has been used to synthesize peptides on a solid support. In the Merrifield method, 
an amino acid is covalently bonded to a support made of an insoluble polymer. Another amino acid with an alpha 
protected group is reacted with the covalently bonded amino acid to form a dipeptide. The protective group is removed, 
and a third amino acid with an alpha protective group is added to the dipeptide. This process is continued until a peptide 
of a desired length and sequence is obtained. Using the Merrifield method, one cannot economically and practically 

45 synthesize more than a few peptide sequences in a day. 

To synthesize larger numbers of oligomer sequences, others have proposed the use of a series of reaction vessels 
for oligomer synthesis. For example, a tubular reactor system may be used to synthesize a linear oligomer on a solid 
phase support by automated sequential addition of reagents. This method still does not enable the synthesis of a 
sufficiently large number of oligomer sequences for effective economical screening. 

50 Methods of preparing a plurality of oligomer sequences are also known in which a foraminous container encloses 

a known quantity of reactive solid supports, the solid supports being larger in size than openings of the container. See 
US Patent No 4,631,211. The containers may be selectively reacted with desired materials to synthesize desired se- 
quences of product molecules. As with other methods known in the art, this method cannot practically be used to 
synthesize a sufficient variety of polypeptides for effective screening. 

55 Other techniques have also been described. One bead-based method is described in PCT patent publication No 

92/00091 . These methods include the synthesis of peptides on 96 plastic pins that fit the format of standard microtiter 
plates. See PCT patent publications 84/03564; 86/00991; and 86/06487. Unfortunately, while these techniques have 
been somewhat useful, substantial problems remain. For example, these methods continue to be limited in the diversity 
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of sequences which can be economically synthesized and screened. 

Others have developed recombinant methods for preparing collections of oligomers. See PCT patent publication 
Nos 91/17271 and 91/19818. In another important development, scientists combined the techniques of photolithogra- 
phy, chemistry, and biology to create large collections of oligomers and other compounds on the surface of a substrate. 
5 See US Patent No 5,143,854 and PCT patent publication Nos 90/15070 and 92/10092. 

In the recombinant and VLSIPS™ combinatorial methods, one can uniquely identify each oligomer in the library 
by determining the coding sequences in the recombinant organism or phage or by the location of the oligomer on the 
VLSIPS™ chip. In other methods, however, the identity of a particular oligomer may be difficult to ascertain. What is 
needed in these latter methods is an efficient and simple-to-use method for tagging each particle. Although tagging 
10 methods have been developed for large objects, see PCT patent publication Nos 90/1 4441 and 87/06383, such meth- 
ods are still needed for combinatorial libraries of oligomers. 

From the above, one can recognize that improved methods for synthesizing a diverse collection of chemical se- 
quences would be beneficial. 

15 SUMMARY OF THE INVENTION 

The invention provides the use of identifier tags to enable subsequent identification of reactions through which 
members of a library of different synthetic compounds have been synthesised in a component by component fashion 
and consequent deductive structural identification of said members. 

20 in another aspect, the invention provides a library of different synthetic compounds, which compounds are obtain- 

able by synthesis in a component by component fashion which links each compound to one or more identifier tags 
which enable subsequent identification of reactions through which said components were incorporated and consequent 
deductive structural identification of said members. 

The present invention provides a general stochastic method for synthesizing libraries of random oligomers. The 

25 random oligomers are synthesized on solid supports, or particles, but may be cleaved from these supports to provide 
a soluble library. The oligomers are composed of a sequence of monomers, the monomers being any member of the 
set of molecules that can be joined together to form an oligomer or polymer, i.e., amino acids, carbamates, sulfones, 
sulfoxides, nucleosides, carbohydrates, ureas, phosphonates, lipids, esters, combinations of the same, and the like. 
The library is then screened to isolate individual oligomers that bind to a receptor or possess some desired property. 

30 Each oligomer sequence in the library is unique, in a preferred embodiment. In another preferred embodiment, the 
solid supports are nonporous beads. The solid supports may be composed of a single particle, or two or more linked 
particles. 

The invention involves the use of an identifier tag to identify the sequence of monomers in the oligomer. The 
identifier tag, which may be attached directly to the oligomer with or without an accompanying particle, to a linker 

35 attached to the oligomer, to the solid support upon which the oligomer is synthesized, or to a second particle attached 
to the oligomer-carrying particle, may be any recognizable feature that in some way carries the required information, 
and that is decipherable at the level of one or a few solid supports. The solid supports may be joined to the oligomers 
and the identifier tag by means of one or more linker molecules. 

In a preferred embodiment, the identifier tag will be an oligonucleotide, preferably composed of pyrimidines or 

40 pyrimidines and purine analogs or any type of nucleoside that will not degrade under the coupling conditions used to 
assemble the oligomer library. The oligonucleotide identifier tag may contain a 5* and a 3* amplification site, to allow 
amplification ofthe tag by, for example, the polymerase chain reaction (see US Patent Nos 4,683,202; and 4,965,188). 
A DNA sequencing primer site, which may be specific for each step of the oligomer synthesis, may also be included 
in the oligonucleotide tag in addition to the amplification primer sites. The tag may be designed to include, in the 

45 oligonucleotide sequence, information allowing identification of the monomer associated with the addition of the par- 
ticular tag. The oligonucleotide tag will be about 50 to 100 nucleotides in length, in a preferred embodiment. 

In another preferred embodiment, the identifier tag may be composed of a set of light-addressable compounds, 
such as fluorescent or phosphorescent compounds that can be photob leached, which compounds are incorporated 
into the beads or particles on which the oligomers of the oligomer library are synthesized. Such compounds are widely 

50 known in the art. 

BRIEF DESCRIPTION OF THE FIGURES 

Fig. 1 is a schematic representation of combinatorial oligomer synthesis on particles. 
55 Fig. 2 is a schematic representation of concurrent combinatorial oligomer synthesis and particle tagging. 

Fig. 3 describes one method of bead f unctionalization, the compatible chemistries for peptide synthesis and round 
by round attachment of oligonucleotide identifier tags, including synthesis of amino-f unctionalized beads, the structure 
of protected 5' maleimidyl oligonucleotides, amino acid coupling and introduction of a thiol "handle," step-specific oli- 
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gonucleotide attachment to a bead, subsequent amino acid coupling(s) and oligonucleotide attachment(s), and peptide 
and oligonucleotide deprotection. 

Fig. 4 is a schematic representation of one example of an oligonucleotide tag. 

Fig. 5 illustrates nucleoside phosphoramidites derivatized with photolabile protecting groups for parallel peptide/ 
5 oligonucleotide synthesis. 

Fig. 6 illustrates 5'-DMT-3'-(0-allyl N.N'-diisopropyl phosphoramidite) nucleoside derivatives for parallel peptide/ 
oligonucleotide synthesis. 

Fig. 7 illustrates the preparation of a bifunctional bead material for parallel synthesis of peptides and oligonucle- 
otides. 

10 Fig. 8 illustrates the parallel assembly of oligonucleotide-tagged peptides on beads. 

Fig. 9 shows a schematic representation of the experiment described in Example 5, in which two populations of 
oligomers on beads are prepared, tagged, mixed, sorted, and identified by the method of the present invention. 

Fig. 10 shows resolution of the two populations of beads by FACS in the experiment described in Example 5. 
Values along the horizontal axis indicate relative fluorescence (log scale), values along the vertical axis indicate relative 
15 numbers of beads. Non-fluorescently labeled beads are represented by peak A. Flourescently labeled beads are rep- 
resented by peak B. The ratio of the larger peak to the smaller peak is 15:1 . 

Fig. 11 shows pictures of ethidium bromide stained, UV irradiated agarose gels of PCR products obtained by 
amplification after FACS of two bead populations and amplification of the tags on the sorted beads, with controls, as 
described in Example 5. Gel A shows the results with sorted fluorescent beads: lane 1 - 2.4x10 s copies (100 bead 
20 equivalents) of 95 mer tag; lanes 2-7 - PCR product from single fluorescent beads; lanes 8-1 0 - PCR product from ten 
fluorescent beads; lanes 11-1 3 - PCR product from one hundred fluorescent beads; and lane 14-1 .2x1 0 6 copies (100 
bead equivalents) of 110 mer tag. Gel B shows the result with sorted non-fluorescent beads: lane 1 - 1 .2x1 0 6 copies 
of 110 mer tag; lanes 2-7 - PCR product from single non-fluorescent beads; lanes 8-10 - PCR product from ten non- 
fluorescent beads; lanes 11-13 - PCR product from one hundred non-fluorescent beads; and lane 14 - 2.4x1 0 6 copies 
25 of 95 mer tag. Gel C shows the results with the control reactions: lanes 1,12- DNA size standards; lanes 2,3 - no tag 
control reactions; lanes 4,5 - 1 bead equivalent of soluble 95 mer tag lanes 6, 7 - 10 bead equivalents of soluble 95 
mer tag; lanes 8, 9 - 1 bead equivalent of soluble 110 mer tag; and lanes 10, 11 - 10 bead equivalents of soluble 110 
mer tag. 

30 DESCRIPTION OF THE SPECIFIC EMBODIMENTS 

The present invention provides novel methods for producing large synthetic oligomer libraries. In a preferred em- 
bodiment of the present invention, each member of such a library is uniquely labeled in a manner that specifies the 
identity of the sequence of the oligomer corresponding to that member. Methods for screening such libraries and rea- 
35 gents useful for producing the libraries are also provided. 

Glossary 

The following terms are intended to have the following general meanings as they are used herein: 

40 Complementary or substantially complementary: These terms refer to base pairing between nucleotides or nucleic 

acids, such as, for instance, between the two strands of a double stranded DNA molecule or between an oligonucleotide 
primer and a primer binding site on a single stranded nucleic acid to be sequenced or amplified. " Complementary " 
nucleotides are, generally, A and T (or A and U), and C and G, as is well known to those of skill in the art. Two single 
stranded RNA or DNA molecules are said to be "substantially complementary" when the nucleotides of one strand, 

45 optimally aligned, pair with at least about 80% of the nucleotides of the other strand. 

Alternatively, substantial complementarity exists when an RNA or DNA strand will hybridize under selective hy- 
bridization conditions to a complementary nucleic acid. Typically, selective hybridization will occur when there is at 
least about 55% complementarity over a stretch of at least 14 to 25 nucleotides, but more selective hybridization will 
occur as complementarity increases to 65%, 75%, 90%, and 100%. See Kanehisa, Nucleic Acids Res. 12 :203 (1984). 

so Stringent hybridization conditions will typically include salt concentrations of less than about 1 M, such as less 

than 500 mM, and will often include salt concentrations of less than 200 mM. The hybridization temperature for oli- 
gomers will typically be greater than 22°C, such as greater than about 30°C, and will often be in excess of about 37°C. 
Longer fragments may require higher hybridization temperatures for specific hybridization. As other factors may dra- 
matically affect the stringency of hybridization (such factors include base composition, length of the complementary 

55 strands, presence of organic solvents, and extent of base mismatching), the combination of factors is more important 
than the absolute measure of any one factor alone. 

Epitope : The portion of an antigen molecule delineated by the area of interaction with the subclass of receptors 
known as antibodies is an "epitope." 
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Identifier tag : An "identifier tag" is a physical attribute that provides a means whereby one can identify which 
monomer reactions an individual solid support has experienced in the synthesis of an oligomer. The identifier tag also 
records the step in the synthesis series in which the solid support visited that monomer reaction. The identifier tag may 
be any recognizable feature, including for example: a microscopically distinguishable shape, size, color, optical density, 

5 etc.; a differential absorbance or emission of light; chemically reactivity; magnetic or electronic encoded information; 
or any other distinctive mark with the required information, and decipherable at the level of one (or a few) solid support 
(s). A preferred example of such an identifier tag is an oligonucleotide sequence. An "identifier tag" can be coupled 
directly to the oligomer synthesized, whether or not a solid support is used in the synthesis. In this latter embodiment, 
the identifier tag serves as the "support" for oligomer synthesis. 

10 Liqand : A "ligand" is a molecule that is recognized by a particular receptor. The agent bound by or reacting with a 

receptor is called a "ligand", a term which is definitional^ meaningful only in terms of its counterpart receptor. The term 
"ligand" does not imply any particular molecular size or other structural or compositional feature other than that the 
substance in question is capable of binding or otherwise interacting with the receptor. Also, a "ligand" may serve either 
as the natural ligand to which the receptor binds, or as a functional analogue that may act as an agonist or antagonist. 

is Ligands that can be investigated by this invention include, but are not restricted to, agonists and antagonists for cell 
membrane receptors, toxins and venoms, viral epitopes, hormones, sugars, cofactors, peptides, enzyme substrates, 
drugs (e.g., opiates, steroids, etc), and proteins. 

Monomer : A "monomer" is any member of the set of molecules which can be joined together to form an oligomer 
or polymer. The set of monomers useful in the present invention includes, but is not restricted to, for the example of 

20 peptide synthesis, the set of L-amino acids, D-amino acids, or synthetic amino acids. As used herein, "monomer" refers 
to any member of a basis set for synthesis of an oligomer. For example, dimers of L-amino acids form a basis set of 
400 "monomers" for synthesis of polypeptides. Different basis sets of monomers may be used at successive steps in 
the synthesis of a polymer. 

Oligomer or Polymer : The "oligomer" or "polymer" sequences of the present invention are formed from the chemical 

25 or enzymatic addition of monomer subunits. Such oligomers include, for example, both linear, cyclic, and branched 
polymers of nucleic acids, polysaccharides, phospholipids, and peptides having either alpha-, beta-, or omega-amino 
acids, heteropolymers, polyurethanes, polyesters, polycarbonates, polyureas, polyamides, polyethyleneimines, pol- 
yarylene sulfides, polysiloxanes, polyimides, polyacetates, or other polymers, as will be readily apparent to one skilled 
in the art upon review of this disclosure. 

30 Peptide : A "peptide" is an oligomer in which the monomers are alpha amino acids joined together through amide 

bonds. Alternatively, a "peptide" can be referred to as a "polypeptide." In the context of this specification, one should 
appreciate that the amino acids may be the L-optical isomer or the D-optical isomer. Peptides are more than two amino 
acid monomers long, but more often are more than 10 amino acid monomers long and can be even longer than 20 
amino acids, although peptides longer than 20 amino acids are more likely to be called "polypeptides." Standard single 

35 letter abbreviations for amino acids are used (e.g., P for proline). 

Oligonucleotides : An "oligonucleotide" is a single-stranded DNAor RNA molecule, typically prepared by synthetic 
means. Those oligonucleotides employed in the present invention will usually be 50 to 150 nucleotides in length, pref- 
erably from 80 to 120 nucleotides, although oligonucleotides of different length may be appropriate in some circum- 
stances. For instance, in one embodiment of the invention, the oligonucleotide tag and the polymer identified by that 

40 tag are synthesized in parallel. In this embodiment, the oligonucleotide tag can be built nucleotide-by-nucleotide in 
coordination with the monomer-by-monomer addition steps used to synthesize the oligomer. In addition, very short, i. 
e., 2 to 10 nucleotides, oligonucleotides may be used to extend an existing oligonucleotide tag to identify a monomer 
coupling step. Suitable oligonucleotides may be prepared by the phosphoramidite method described by Beaucage and 
Carruthers, Tetr. Lett . 22:1859-1862 (1 981 ), or by the triester method, according to Matteucci et aj., J. Am. Chem. Soc. 

45 103 :3185 (1981), or by other methods such as by using commercial automated oligonucleotide synthesizers. 

Operably linked: A nucleic acid is "operably linked" when placed into a functional relationship with another nucleic 
acid sequence. For instance, a promoter or enhancer is "operably linked" to a coding sequence if the promoter causes 
the transcription of the sequence. Generally, operably linked means that the DNA sequences being linked are contig- 
uous and, where necessary to join two protein coding regions, contiguous and in reading frame. 

50 Receptor : A "receptor" is a molecule that has an affinity for a given ligand. Receptors may be naturally-occurring 

or manmade molecules. Also, receptors can be employed in their unaltered natural or isolated state or as aggregates 
with other species. Receptors may be attached, covalently or noncovalently, to a binding member, either directly or via 
a specific binding substance. Examples of receptors that can be employed in the method of the present invention 
include, but are not restricted to, antibodies, cell membrane receptors, monoclonal antibodies, antisera reactive with 

55 specific antigenic determinants (such as on viruses, cells, or other materials), polynucleotides, nucleic acids, lectins, 
polysaccharides, cells, cellular membranes, and organelles. Receptors are sometimes referred to in the art as "anti- 
ligands." As the term "receptor" is used herein, no difference in meaning is intended. A "ligand-receptor pair" is formed 
when two macromolecules have combined through molecular recognition to form a complex. 
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Other examples of receptors that can be investigated by this invention include, but are not restricted to: 
Microorganism receptors: Determination of ligands that bind to microorganism receptors, such as specific transport 

proteins or enzymes essential to survival of microorganisms, is useful in discovering new classes or types of antibiotics. 

Of particular value would be antibiotics against opportunistic fungi, protozoa, and those bacteria resistant to the anti- 
5 biotics in current use. 

Enzymes : For instance, the binding site of any enzyme, such as the enzymes responsible for cleaving neurotrans- 
mitters, is a receptor. Determination of ligands that bind to certain enzymes, and thus modulate the action of the en- 
zymes that cleave the different neurotransmitters, is useful in the development of drugs that can be used in the treatment 
of disorders of neurotransmission. 
10 Antibodies : For instance, the invention may be useful in investigating the ligand-binding site on an antibody mol- 

ecule that combines with the epitope of an antigen of interest. Determining a sequence that mimics an antigenic epitope 
may lead to the development of vaccines or lead to the development of related diagnostic agents or compounds useful 
in therapeutic treatments such as for autoimmune diseases (e.g., by blocking the binding of the "self antibodies). 

Nucleic Acids: The invention may be useful in investigating sequences of nucleic acids acting as binding sites for 
is cellular proteins ( "trans -acting factors"). Such sequences may include, e.g., enhancers or promoter sequences. 

Catalytic Polypeptides: Polymers, preferably polypeptides, which are capable of promoting a chemical reaction 
involving the conversion of one or more reactants to one or more products are "catalytic polypeptides." Such polypep- 
tides generally include a binding site specific for at least one reactant or reaction intermediate and an active functionality 
proximate to the binding site, which functionality is capable of chemically modifying the bound reactant. Catalytic 
20 polypeptides are described in, Lerner et al., Science 252: 659 (1991). 

Hormone receptors : For instance, "hormone receptors" include the receptors for insulin and growth hormone. 
Determination of the ligands which bind with high affinity to a hormone receptor is useful in the development of, for 
example, an oral replacement of the daily injections which diabetics must take to relieve the symptoms of diabetes, 
and in the other case, a replacement for human growth hormone, which can only be obtained from cadavers or by 
25 recombinant DNA technology. Other examples include the vasoconstrictive hormone receptors; determination of lig- 
ands that bind to those receptors may lead to the development of drugs to control blood pressure. 

Opiate receptors: Determination of ligands that bind to the opiate receptors in the brain is useful in the development 
of less-addictive replacements for morphine and related drugs. 

Substrate or Solid Support : A "substrate or solid support" is a material having a rigid or semi-rigid surface. Such 
30 materials will preferably take the form of small beads, pellets, disks, or other convenient forms, although other forms 
may be used. In some embodiments, at least one surface of the substrate will be substantially flat. A roughly spherical 
shape is preferred. 

Synthetic : A compound is "synthetic" when produced by jn vitro chemical or enzymatic synthesis. The synthetic 
libraries of the present invention may be contrasted with those in viral or plasmid vectors, for instance, which may be 
35 propagated in bacterial, yeast, or other living hosts. 



I. Method for Producing Large Synthetic Oligomer Libraries 



A general method of random oligomer synthesis can be used to produce the enormous numbers of compounds 
available with recombinant systems and to utilize the monomer set diversity available with chemical synthesis methods. 
One can readily produce up to 10 1 2 different oligomers, a dramatic improvement over previous methods. The invention 
provides a facile means of oligomer identification. 

The general method comprises producing a large, highly diverse collection or library, each member of such a library 
comprising a single oligomer sequence (e.g., a peptide). The sequence may be soluble or may be bound to a solid 
support. When bound to a solid support, the oligomer is usually attached by means of a linker. The linker, prior to 
attachment, has an appropriate functional group at each end, one group appropriate for attachment to the support and 
the other group appropriate for attachment to the oligomer. Such a collection may contain, for example, all combinations 
of n monomers assembled into X length oligomers yielding, n x different compounds. The collection may also contain 
oligomers having different monomer units at, for example, only one or a small number of positions, while having an 
identical sequence at all other positions. The general method typically involves synthesizing the oligomers in a random 
combinatorial ('stochastic") fashion by chemical and/or enzymatic assembly of monomer units. 

A synthetic oligomer library may be produced by synthesizing on each of a plurality of solid supports a single 
oligomer sequence, the oligomer sequence being different for different solid supports. The oligomer sequence is syn- 
thesized in a process comprising the steps of: (a) apportioning the supports in a stochastic manner among a plurality 
of reaction vessels; (b) exposing the supports in each reaction vessel to a first monomer; (c) pooling the supports; (d) 
apportioning the supports in a stochastic manner among the plurality of reaction vessels; (e) exposing the supports in 
each reaction vessel to a second monomer; and (f) repeating steps (a) through (e) from at least one to twenty times. 
Typically, substantially equal numbers of solid supports will be apportioned to each reaction vessel. The monomers 
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may be chosen from the set of amino acids, and the resulting oligomer is a peptide. 

As a specific example of the method, one may consider the synthesis of peptides three residues in length, assem- 
bled from a monomer set of three different monomers: A, B, and C. The first monomer is coupled to three different 
altquots of beads, each different monomer in a different aliquot, and the beads from all the reactions are then pooled 

s (see Fig. 1 ). The pool now contains approximately equal numbers of three different types solid supports, with each 
type characterized by the monomer in the first residue position The pool is mixed and redistributed to the separate 
monomer reaction tubes or vessels containing A, B, or C as the monomer. The second residue is coupled. 

Following this reaction, each tube now has beads with three different monomers in position one and the monomer 
contained in each particular second reaction tube in position 2. All reactions are pooled again, producing a mixture of 

io beads each bearing one of the nine possible dinners. The pool is again distributed among the three reaction vessels, 
coupled, and pooled. This process of sequential synthesis and mixing yields beads that have passed through all the 
possible reaction pathways, and the collection of beads displays all trimers of three amino acids (& = 27). Thus, a 
complete set of the trimers of A, B, and C is constructed. As can be readily appreciated, the use of a sufficiently large 
number of synthesis beads helps to ensure that the set completely represents the various combinations of monomers 

15 employed in this random, combinatorial synthesis scheme. 

This method of assembling oligomers from many types of monomers requires using the appropriate coupling chem- 
istry for a given set of monomer units or building blocks. Any set of building blocks that can be attached to one another 
in a step-by-step fashion can serve as the monomer set. The attachment may be mediated by chemical, enzymatic, 
or other means, or by a combination of any of these means. The resulting oligomers can be linear, cyclic, branched, 

20 or assume various other conformations as will be apparent to those skilled in the art. Techn iques for solid state synthesis 
of polypeptides are described, for example, in Merrifield, supra . Peptide coupling chemistry is also described in The 
Peptides, Vol. 1 (eds. Gross, E., and J. Meienhofer, Academic Press, Orlando (1979)). 

To synthesize the oligomers, a collection of a large number of the solid supports is apportioned among a number 
of reaction vessels. In each reaction, a different monomer is coupled to the growing oligomer chain. The monomers 

25 may be of any type that can be appropriately activated for chemical coupling or accepted for enzymatic coupling. 
Because the reactions may be contained in separate reaction vessels, even monomers with different coupling chem- 
istries can be used to assemble the oligomers (see The Peptides, supra ). The coupling time for some of the monomer 
sets may be long. For this reason, the preferred arrangement is one in which the monomer reactions are carried out 
in parallel. After each coupling step, the solid supports on which are synthesized the oligomers of the library are pooled 

30 and mixed prior to re-allocation to the individual vessels for the next coupling step. This shuffling process produces 
solid supports with many oligomer sequence combinations. If each synthesis step has high coupling efficiency, then 
substantially all the oligomers on a single solid support have the same sequence. That sequence is determined by the 
synthesis pathway (type and sequence of monomer reactions) for any given solid support at the end of the synthesis. 
The maximum length of the oligomers is typically less than about 20, usually from 3 to 8 residues in length, but in some 

35 cases a length of 10 to 12 residues is preferred. Protective groups known to those skilled in the art may be used to 
prevent spurious coupling (see The Peptides, Vol. 3 (eds. Gross, E., and J. Meienhofer, Academic Press, Orlando 
(1981)). 

Modifications of this completely random approach are also possible. For example, the monomer set may be ex- 
panded or contracted from step to step; or the monomer set could be changed completely for the next step (e.g., amino 

40 acids in one step, nucleosides in another step, carbohydrates in another step), if the coupling chemistry were available 
(see Gait, Oligonucleotide Synthesis: A Practical Approach , IRL Press, Oxford (1984); Friesen and Danishefsky, J. 
Amer. Chem. Soc. 111 :6656 (1989); and Paulsen, Angew. Chem. Int. Ed. Engl. 25:212 (1986). Monomer units for 
peptide synthesis, for example, may include single amino acids or larger peptide units, or both. One variation is to form 
several pools of various sequences on solid supports to be distributed among different monomer sets at certain steps 

45 of the synthesis. By this approach, one can also build oligomers of different lengths with either related or unrelated 
sequences, and one can fix certain monomer residues at some positions while varying the other residues, to construct 
oligomer frameworks wherein certain residues or regions are altered to provide diversity. 

The chemical or enzymatic synthesis of the oligomer libraries of the present invention typically takes place on solid 
supports. The term "solid support" as used herein embraces a particle with appropriate sites for oligomer synthesis 

50 and, in some embodiments, tag attachment and/or synthesis. There are various solid supports useful in preparation 
of the synthetic oligomer libraries of the present invention. Solid supports are commonly used for solid phase synthesis 
of, for example, peptides and nucleic acids and other oligomers as enumerated above, and thus are well known to 
those skilled in the art. 

With enough solid supports and efficient coupling, one can generate complete sets of certain oligomers, if desired. 
55 in general, the solid support size is in the range of 1 nm to 100 (am, but a more massive solid support of up to 1 mm 
in size may be used. The appropriate size of the solid support depends on (1) the number of oligomer synthesis sites 
and identifier tag attachment sites desired; (2) the number of different compounds to be synthesized (and the number 
of solid supports bearing each oligomer that are needed for screening); and (3) the effect of the size of the solid supports 
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on the specific screening strategies [eg., fluorescence-activated cell sorters (FACS)] to be used. 

As a specific example, solid supports of 1 jam in diameter may be used. If each reaction contains approximately 
0.2 ml of solid supports, and the oligomers are synthesized from a set of 50 monomers (50 parallel reactions), then a 
total of 10 mL of solid supports, or approximately 10 13 solid supports, would be required. If one wishes to make hex- 
amers with these 50 monomers, then there are over 1 .5 x 10 10 possible sequences, and each specific sequence would 
be represented on about 10 3 solid supports. An estimated capacity of each bead, based on the capacity of commonly 
used peptide synthesizing resins, is about 0.1 pg of peptide per bead. By this estimation, then, each solid support 
would have about 10 8 oligomer chains. 

To improve washing efficiencies, solid supports less porous than typical peptide synthesis resins are preferable. 
These supports will have a lower density of growing chains, but even with a decrease in capacity of several orders of 
magnitude, sufficient oligomer densities can be produced for efficient screening. With the less porous supports, a 
greater proportion of the oligomers will be accessible for binding to the receptor during the screening process. Also, 
the less porous supports will reduce the carryover of tags from one reaction to the next, thus improving the accuracy 
of reading the dominant (correct) tags. 

Such solid supports may be of any shape, although they will preferably be roughly spherical. The supports need 
not necessarily be homogenous in size, shape, or composition; although the supports usually and preferably will be 
uniform. In some embodiments, supports that are very uniform in size may be particularly preferred. In another em- 
bodiment, however, two or more distinctly different populations of solid supports may be used for certain purposes. 

Solid supports may consist of many materials, limited primarily by capacity for derivatization to attach any of a 
number of chemically reactive groups and compatibility with the chemistry of oligomer synthesis and tag attachment. 
Suitable support materials include glass, latex, heavily cross-linked polystyrene or similar polymers, gold or other col- 
loidal metal particles, and other materials known to those skilled in the art. Except as otherwise noted, the chemically 
reactive groups with which such solid supports may be derivatized are those commonly used for solid state synthesis 
of the respective oligomer and thus will be well known to those skilled in the art. The solid supports used in the present 
invention do not include living cells, viruses, or cloning vectors such as phage vectors or plasmids. 

II. Method for Producing Tagged Synthetic Oligomer Libraries 

The oligomers comprising the library also are attached to an identifier tag that can be easily decoded to report the 
sequence of each oligomer. The identifier tags may be attached either to the oligomer or to the solid support to which 
the oligomer is attached. The attachment is preferably by means of a linker that, prior to attachment, has an appropriate 
functional group at each end, one group appropriate for attachment to the support and the other group appropriate for 
attachment to the identifier tag. Alternatively, the identifier tag may be attached to a monomer incorporated into the 
oligomer or attached directly to the same linker that binds the oligomer to the solid support. In the latter embodiment, 
the linker has, prior to attachment, a third functional group appropriate for the attachment of the identifier tag. 

A synthetic oligomer library that incorporates identifier tags is produced by synthesizing on each of a plurality of 
solid supports a single oligomer sequence and one or more identifier tags identifying the oligomer sequence. The 
tagged synthetic oligomer library is synthesized in a process comprising the steps of: (a) apportioning the supports 
among a plurality of reaction vessels; (b) exposing the supports in each reaction vessel to a first oligomer monomer 
and to a first identifier tag monomer; (c) pooling the supports; (d) apportioning the supports among a plurality of reaction 
vessels; and (e) exposing the supports to a second oligomer monomer and to a second identifier tag monomer. As 
noted above, one can also practice the invention in a mode in which there is no solid support; in this mode, the tag is 
attached directly to the oligomer being synthesized. The steps of either process typically will be repeated one or more 
times, but usually, will be repeated less than 20 times. 

The solid supports can be exposed to (or coupled with) an oligomer monomer and an identifier tag at the same 
time, or sequentially. In either event, the supports are then pooled and exposed to the second oligomer monomer and 
second identifier tag. As before, these steps are repeated, typically from one to about 20 times. The invention is de- 
scribed herein primarily with regard to the preparation of molecules containing sequences of amino acids, but the 
invention can readily be applied to the preparation of other oligomers and to any set of compounds that can be syn- 
thesized in a component-by-component fashion, as can be appreciated by those skilled in the art. 

In another embodiment, the same solid support is used for synthesizing all members of the library but the members 
are cleaved from the support prior to screening. In this embodiment, synthesis of tagged oligomers may be accom- 
plished utilizing very large scale immobilized polymer synthesis (VLSI PS™) techniques. See U.S. Patent No. 5, 1 43,854 
and PCT patent publication No. 92/10092. An array of oligonucleotides is synthesized on the VLSI PS™ chip, each 
oligonucleotide linked to the chip by a cleavable group such as a disulfide. In one embodiment, each oligonucleotide 
tag has an amine group at the free end and only contains pyrimidine or pyrimidine and purine analog bases. In addition 
each oligonucleotide contains binding sites for amplification, i.e., PCR primer sites and optionally a sequencing primer 
site. A short section of each oligonucleotide uniquely codes the monomer sequence of the oligomer to be tagged. Then, 
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e.g., peptides are synthesized, optionally from the free terminal amine groups on each oligonucleotide, so that each 
peptide is linked to a tag. The whole collection of oligonucleotide-peptide may be released from the chip to create a 
soluble tagged oligomer library. 

More preferably, however, the oligomer library is constructed on beads or particles. One method of bead function- 

5 alization, with compatible chemistries for peptide synthesis and round by round attachment of oligonucleotide identifier 
tags, is shown in Figs. 3.1-3.6. Glass beads are derivatized using aminopropyltriethoxysilaneand a beta -alanine spacer 
group is coupled using activated ester methodology. The oligonucleotide tags may optionally incorporate a biotin group 
to facilitate purification, hybridization, amplification, or detection (see Pierce Immuno Technology Catalog and Hand- 
book , 1991). Commercially available Fmoc protected amino acids and standard BOP coupling chemistry is employed 

10 for peptide synthesis (see The Peptides, supra) . Protected polypyrimidine (e.g., cytidine protected as fv^-Bz-C) and/ 
or purine analog containing oligonucleotides resistant to the coupling and deprotection reagents used in peptide syn- 
thesis are attached using maleimide chemistry to unmasked thiol groups incorporated into growing peptide chains at 
low frequency (i.e., 0.1%) as cysteine residues with masked thiol groups (which masks may be selectively removed 
prior to tagging). In other embodiments of the invention, one may not need to use protected nucleosides or oligonu- 

15 cleotides. 

However, to maintain the integrity of an oligonucleotide tag during peptide synthesis, one may need to use different 
combinations of protecting groups and/or synthetic nucleotides to avoid degradation of the tag or the oligomer synthe- 
sized. In general, polypyrimidine oligonucleotide tags are relatively stable under typical peptide synthesis conditions, 
as opposed to oligonucleotide tags that contain natural purine nucleotides, but a polypyrimidine nucleotide tag may 

20 be somewhat refractory to amplification by PCR. One may need to incorporate purine bases, or analogs tested for 
ability to withstand peptide coupling (and deprotection) conditions, into the tag to acheive a desired efficiency of am- 
plification. For purposes of the present invention, the tag optionally may contain from 10 to 90%, more preferably 35 
to 50%, and most preferably 33 to 35%, purine or purine analog nucleotides. The oligonucleotides optionally may 
contain phosphate protecting groups (e.g., O-methyl phosphates) with greater base stability than the standard beta- 

25 cyanoethyl group, which may be susceptible to piperidine cleavage. In such cases, peptide and oligonucleotide de- 
protection can be effected by sequential treatment with thiophenol, trifluoroacetic acid, and ethanolic ethylenediamine 
at 55°C. In another embodiment, photolabile alpha-amino protecting groups are used in conjunction with base-labile 
side chain protecting groups for the amino acids, and standard beta-cyanoethyl protecting groups are used for the 
oligonucleotide tags. 

30 in another embodiment, oligonucleotides containing both modified or synthetic purines and pyrimidines may be 

synthesized in parallel with peptides using conventional Fmoc^Bu protected amino acids. In this method, one can also 
use O-allyl and N-allyloxycarbonyl groups to provide protection for phosphate oxygens and the exocydic amines of the 
nucleoside bases, respectively (see Hayakawa et at., J. Amer. Chem. Soc. 112 : 1691-1696 (1990)). By employing the 
mild oxidant tBuOOH for oxidation at the phosphorous, one can minimize oxidation of the amino acids methionine, 

35 tryptophan, and histidine (see Hayakawa et aj. , Tetr. Lett. 27:4191-4194 (1986)). Use of pyridinium hydrochloride/ 
imidazole as a phosphporamidite activator leads to selective 5'-0-phosphitylation at the expense of low levels of spu- 
rious reaction at nitrogen on the peptide or oligonucleotide (see Gryaznov and Letsinqer, Nucleic Acids Research 20 : 
1879-1882 (1992)). The lability of purine nucleotides to strong acid (e.g., TFA) is avoided by use of phosphoramidites 
of the purine nucleoside analogs 7-deaza-2'-deoxyadenosine and 7-deaza-2'-deoxyguanosine (see Barr et al., Bio- 

40 Techniques 4:428-432 (1986), and Scheit, Nucleotide Analogs: Synthesis and Biological Function pp. 64-65 (John 
Wiley and Sons, New York)). 

The fully assembled peptide and oligonucleotide chains may be deprotected by first treating the products with 30% 
piperidine in DMF to remove amino-terminal Fmoc groups. Then, the allylic protecting groups are removed using THF 
containing tris (dibenzylideneacetone) dipalladium-chloroform complex, triphenylphosphine, and n-butylamine/formic 

45 acid, followed by a THF wash, an aqueous sodium N,N-diethyldithiocarbamate wash, and a water wash. Finally, the 
acid-labile amino acid protecting groups are removed by treatment with 95:5 TFA/water. 

Other methods also provide effective orthogonal protection during the parallel assembly of oligonucleotides and 
peptides. These methods include use of acid-labile protecting groups on phosphates and exocyclic amines of deoxy- 
cytidine, 7-deaza-deoxyadenosine, and 7-deaza-deoxyguanosine sufficiently robust to resist the 3% trichloroacetic 

50 acid used in 5'-0-detritylation; use of photochemically removable protecting groups on these residues; and combinations 
of such acid and photolabile- groups (for photolabile protecting groups for phosphate, see Baldwin et al.. Tetr. Lett. 46 : 
6879-6884 (1990); see also figure 5). 

III. Identifying the Sequence of any Oligomer 

55 

The present invention provides a method for identifying the composition and sequence of any of the oligomers in 
the library. By tracking the synthesis pathway that each oligomer has taken, one can deduce the sequence of monomers 
of any oligomer. The method involves linking an identifier tag to the oligomer that indicates the monomer reactions and 
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corresponding step numbers that define each oligomer in the library. After a series of synthesis steps (and concurrent 
identifier tag additions), one "reads" the identifier tag(s) associated with an oligomer to determine the sequence of that 
oligomer. 

For example, one might attach microscopically recognizable, alphanumeric tags to each bead (see Fig. 2): "Al" 
means that the bead participated in the A-monomer reaction at step 1 , "C2" means that the bead participated in the 
C-monomer reaction at step 2, and "B3" means B-monomer was added in step 3, and so on. At the end of the 3-step 
synthesis, the bead would have three tags attached, e.g., Al, C2, and B3, indicating that the sequence of the peptides 
on the bead is ACB. This scheme requires a number of distinct identifier tags equal to at most the product of the number 
of different monomers and the number of synthesis steps (nine in this example). The number of identifier tags is reduced 
if the symbols are attached to one another in the order of the steps: A, A-C, A-C-B. In this case only as many identifier 
tags are needed as monomers. One builds the identifier tag in much the same way as the peptides, so as to preserve 
a record of what was monomer was added, and in which addition step. 

The identifier tags therefore identify each monomer reaction that an individual library member or solid support has 
experienced and record the step in the synthesis series in which each monomer is added. The tags may be attached 
immediately before, during, or after the monomer addition reaction, as convenient and compatible with the type of 
identifier tag, modes of attachment, and chemistry of oligomer synthesis. The identifier tag is added when the solid 
supports that have undergone a specific monomer addition step are physically together and so can be tagged as a 
group, i.e., prior to the next pooling step. 

In some cases, of course, when only a small number of monomer units of an oligomer are varied, one may need 
to identify only those monomers which vary among the oligomers, as when one wants to vary only a few amino acids 
in a peptide. For instance, one might want to change only 3 to 6 amino acids in peptides 6 to 12 amino acids long, or 
one might want to change as few as 5 amino acids in polypeptides up to 50 amino acids long. One may uniquely identify 
the sequence of each peptide by providing for each solid support an identifier tag specifying only the amino acids varied 
in each sequence, as will be readily appreciated by those skilled in the art. In such cases, all solid supports may remain 
in the same reaction vessel for the addition of common monomer units and apportioned among different reaction 
vessels for the addition of distinguishing monomer units. 

The identifier tag can be associated with the oligomer through a variety of mechanisms, either directly, through a 
linking molecule, or through a solid support upon which the oligomer is synthesized. In the latter mode, one could also 
attach the tag to another solid support that, in turn, is bound to the solid support upon which the oligomer is synthesized. 

IV. Types of Identifier Tags 

The identifier tag may be any recognizable feature that is, for example: microscopically distinguishable in shape, 
size, color, optical density, etc.; differently absorbing or emitting of light; chemically reactive; magnetically or electron- 
ically encoded; or in some other way distinctively marked with the required information, and decipherable at the level 
of one (or few) solid supports. In one embodiment, each bead or other solid support in the library incorporates a variety 
of fluorophores, or other light addressable type of molecules, the spectral properties of which can be changed and 
therefore used to store information. In one such mode, a bead incorporates a variety of fluorophors, each of which can 
be selectively photobleached, and so rendered incapable of fluorescence or of diminished fluoresence. During each 
coupling step, the bead is irradiated (or not) to photobleach (or not) one or more particular types of fluorophors, thus 
recording the monomer identity in the oligomer synthesized. See Science 255: 1213 (6 Mar. 1992). 

One can construct microscopically identifiable tags as small beads of recognizably different sizes, shapes, or 
colors, or labeled with barcodes. The tags can be "machine readable" luminescent or radioactive labels. The identifier 
tag can also be an encodable molecular structure. The information may be encoded in the size (e.g. length of a polymer) 
or the composition of the molecule. The best example of this latter type of tag is a nucleic acid sequence, i.e., RNA or 
DNA assembled from natural or modified bases. 

Synthetic oligodeoxy ribonucleotides are especially preferred information-bearing identifier tags. Oligonucleotides 
are a natural, high density information storage medium. The identity of monomer type and the step of addition is easily 
encoded in a short oligonucleotide sequence and attached, for example, to each peptide synthesis bead. When a 
single bead is isolated by screening, e.g., for receptor binding, the attached oligonucleotides can be amplified by 
methods such as PCR (see PCR Protocols: A Guide to Methods and Applications. (Innis, M, Gelfand, D., Sninsky, J. 
and White, T, Academic Press, San Diego 1 990) ), or by other nucleic acid amplification techniques, such as the ligase 
chain reaction and the self-sustained sequence replication system. The amplified product can be easily sequenced or 
otherwise identified to decode the identity of the peptide on the bead. For this purpose, one can use any of a variety 
of sequencing methods, including sequencing by sequence-specific probe hybridization. 

Alternatively, the information may be encoded in the length rather than, or in addition to, the sequence of the 
oligonucleotide. If only oligonucleotide length is utilized to represent each specific monomer addition to the oligomer, 
then the identity of the oligomer can be decoded by amplifying the oligonucleotide, as described above, and identifying 
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the labels through any of a variety of size-separation techniques, including polyacrylamide gel electrophoresis or cap- 
illary electrophoresis. 

There are several ways that oligonucleotides can be used as identifier tags. The oligonucleotides can be assem- 
bledbase-by-base before, during, or after the corresponding oligomer (e.g., peptide) synthesis step. In one case of 
5 base-by-base synthesis, the tag for each step is a single nucleotide, or at most a very few nucleotides (i.e., 2 to 5). 
This strategy preserves the order of the steps in the linear arrangement of the oligonucleotide chain grown in parallel 
with the oligomer. To preserve the chemical compatibility of the parallel synthetic steps (oligonucleotides and peptides, 
for example), one can modify the standard synthesis chemistries. 

One variation of base-by-base assembly is the block-by-block approach; encoded sets of nucleotides ("codons") 
10 of 5 to 10 or more bases are added as protected, activated blocks. Each block carries the monomer4ype information, 
and the order of addition represents the order of the monomer addition reaction. Alternatively, the block may encode 
the oligomer synthesis step number as well as the monomer-type information. 

One could also attach protected (or unprotected) oligonucleotides containing amplification primer sites, monomer- 
specific information, and order-of-reaction information, from 10 to 50 to 150 bases in length, at each step. At the end 
is of a series of n oligomer synthesis steps, there would be n differently encoded sets of oligonucleotide identifier tags 
associated with each oligomer sequence. After identifying the oligomers with ligand activity, the associated oligonu- 
cleotides are amplified by PCR and sequenced to decode the identity of the oligomer. 

V. Linking the Identifier Tag(s) to the Oligomer 

20 

The identifier tags may be attached to chemically reactive groups (unmasked thiols or amines, for example) on 
the surface of a synthesis support f unctionalized to allow synthesis of an oligomer and attachment or synthesis of the 
oligonucleotide identifier tag. The tags could also be attached to monomers that are incorporated into a small proportion 
of the oligomer chains; or as caps on a small number of the oligomer chains; or to reactive sites on linkers joining the 

25 oligomer chains to the solid support. 

In one embodiment, the solid supports will have chemically reactive groups that are protected using two different 
or "orthogonal" types of protecting groups. The solid supports will then be exposed to a first deprotection agent or 
activator, removing the first type of protecting group from, for example, the chemically reactive groups that serve as 
oligomer synthesis sites. After reaction with the first monomer, the solid supports will then be exposed to a second 

30 activator which removes the second type of protecting group, exposing, for example, the chemically reactive groups 
that serve as identifier tag attachment sites. One or both of the activators may be in a solution that is contacted with 
the supports. 

In another embodiment, the linker joining the oligomer and the solid support may have chemically reactive groups 
protected by the second type of protecting group. After reaction with the first monomer, the solid support bearing the 
35 linker and the "growing" oligomer will be exposed to a second activator which removes the second type of protecting 
group exposing the site that attaches the identifier tag directly to the linker, rather than attachment directly to the solid 
support. 

When activators or deprotection agents are incorporated into the method of preparing a synthetic peptide library 
having a plurality of different members, each member comprising a solid support attached to a different single peptide 

40 sequence and an oligonucleotide identifier tag identifying said peptide sequence, the method comprises: a) apportion- 
ing the solid supports among a plurality of reaction vessels; b) reacting the solid supports with a solution in each reaction 
vessel and treating sequentially with (1 ) a first activator to remove a first type of protective group from the solid support, 
(2) a first amino add or peptide to couple said amino acid or peptide to said solid support at sites where said first type 
of protective group has been removed; (3) a second activator to remove a second type of protective group from the 

45 solid support; and (4) a first nucleotide or oligonucleotide tag to couple said tag at sites where said second type of 
protective group has been removed; c) pooling the solid supports; d) apportioning the pooled solid supports among a 
plurality of reaction vessels; and e) repeating step (b) to couple a second amino acid or peptide and a second nucleotide 
or oligonucleotide tag to said solid support. 

As noted above, the invention can also be carried out in a mode in which there is no solid support, and the tag is 

50 attached directly (or through a linker) to the oligomer being synthesized. The size and composition of the library will 
be determined by the number of coupling steps and the monomers used during the synthesis. Those of skill in the art 
recognize that either the tag or the monomer may be coupled first, in either embodiment. 

Another possible embodiment is the use of two solid supports, such as beads, that are physically linked together, 
one with synthesis sites (or linkers) for the oligomer and one with attachment sites (or linkers) for the identifier tags. 

55 This arrangement allows the segregation of oligomers and identifier tags into discrete "zones" and permits the use of 
widely different chemically reactive groups and chemistries for attachment. The solid supports can be derivatized sep- 
arately and then linked under conditions where all or nearly all of the synthesis solid supports will have a tag-attachment 
solid support in tow. The solid supports can be of different sizes, as for example a large synthesis bead with several 
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(or many) smaller tag-attachment beads linked. In one embodiment, the first solid support will have at least one attached 
amino acid and the second solid support will have at least one attached nucleotide. 

The mode of linking the two beads is constrained by the chemistry of oligomer synthesis. The most obvious means 
of linking the beads is with a heterobif unctional cross-linking agent (for examples of such agents, see Pierce Immu- 
noTechnology Catalog and Handbook pp. E10-E18 (1991)) interacting with the dominant chemically reactive groups 
on each species of solid support. 

VI. Encoding the Identifier Tag Information 

The choice of bases used in an oligonucleotide identifier tag is dictated by the chemistry of oligomer synthesis. 
For example, the use of strong acid to deprotect peptides would depurinate nucleic acids. Therefore, when standard 
chemistries for peptide synthesis are employed, the pyrimidines C and T could be used in a binary code. Thus, in a 
preferred embodiment, the identifier tag will be an oligopyrimidine sequence. 

In another embodiment, the lability of purine nucleotides to strong acid may be overcome through the use of the 
purine nucleoside analogs, such as 7-deaza-2'-deoxyadenosine and 7-deaza-2'-deoxyguanosine (see Barr et al., Bi- 
oTechniques 4 :428-432 (1 986), and Scheit, Nucleotide Analogs: Synthesis and Biological Function pp. 64-65 (John 
Wiley and Sons, New York)). Use of these or other analogs would permit the use of a quaternary or other, as opposed 
to a binary, encoding scheme. 

Information retrieval from oligonucleotide identifier tags is possible through various encryption schemes, two of 
which are described below. In the first, the oligomer sequence information is at least in part encoded in the length of 
the oligonucleotide. Each different monomer added at a given step in the oligomer synthesis may be represented by 
an oligonucleotide tag of unique length. The oligonucleotide inherently contains amplification sites, such as PCR prim- 
ing sequences, characteristic of the given step-number in the oligomer synthesis. Determination of the oligomer com- 
position at any given position in the sequence then involves amplifying the tag using the PCR priming sequence char- 
acteristic for that step in the synthesis and size-separating the amplification products utilizing techniques well known 
in the art, such as gel or capillary electrophoresis (using the tagging oligonucleotides as standards) This embodiment 
is particularly useful when one desires to make a library of compounds related to a lead sequence. One need only tag 
during steps in which a site being analoged is synthesized. 

In addition to length, oligomer sequence information can also be encoded in the sequence of bases comprising 
the oligonucleotide tag. This type of encryption is of value not only in the embodiment in which one attaches a different 
oligonucleotide tag at each coupling step but also in the embodiment in which one extends an oligonucleotide tag at 
each coupling step. For example, as shown in Fig. 4, one may use oligonucleotides of up to about 100 bases (or 
somewhat longer), each having seven regions, as described below. 

Region 1 is a 3'-PCR primer site (20 to 25 bases). This site is used in conjunction with another PCR site (at the 
5'-end of the oligonucleotide) to prime amplification by PCR. Other amplification methods may also be used. 

Region 2 is a "step-specific" DNA sequencing primer site (15-20 bases). This site is specific for the particular 
numbered step in the synthesis series. All the oligonucleotides added to all the beads at a particular step will have this 
sequence in common. Each numbered step will have a highly specific primer site representing that step. 

Region 3 is a spacer (20-30 bases). A spacer segment of variable length, but preferably 20 to 30 bases long, 
places the coding site sufficiently distant from the sequencing primer site to give a good "read" through the monomer 
encoding or identification region. 

Region 4 is a monomer identification region (8 bases). Each base in this string represents one bit of binary code, 
where, for example, T = 0 and C = 1 . Each set of step-specific identifier tags consists of 8 bases with a 1 (C) or a O 
(T) at each of the 8 positions. These may be thought of as switches set to "on" or "off" at the different positions. Each 
monomer type is encoded by a mixture of 1 to 8 of these "switches." 

Region 5 is a step number confirmation region (4 bases plus 2 bases on either side for region distinction). Four 
bits in this short stretch encode the step number. This is redundant to the sequencing primer but can be used to confirm 
that the proper primers were used and that the right step is decoded. 

Region 6 is a repeat of the monomer identification region (8 bases). This region has the same information as region 
4, and is used to confirm monomer identity. Installing this second monomer encoding region also increases the prob- 
ability that a good sequencing "read" will be obtained. 

Region 7 is a 5'-PCR primer site (20 to 25 bases). This site serves as a site for annealing the second PCR primer 
for amplification of the sequence. The length of oligonucleotides with all seven of these features, some of which are 
optional, will commonly be between 75 and 125 bases. 

An 8 bit format can encode 256 different monomer types. The number of steps that can be encoded is determined 
by the number of step-specific sets (8 per set) of oligonucleotides on hand. With 10 sets (80 oligos) one can encode 
up to 256 different monomers assembled into oligomers up to 10 units long (thus providing encoding capability for up 
to 256 10 = 1 2 x 10 24 oligomer sequences). The coded identifier tags may be used so that each monomer is assigned 
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a specific binary number (e.g. Ala = 00000001, Gly = 00000110, etc.). The appropriate oligonucleotides are combined 
to give the correct binary code. 

VII. Recovering and Decoding the Identifier Tag Information 

5 

When specific beads are isolated in a receptor screening experiment, the beads can be segregated individually 
by a number of means, including: infinite dilution, micromanipulation, or preferably, fluorescence activated cell sorting 
(FACS), although, with respect to the present invention, FACS is more accurately 'fluorescence activated oligomer or 
solid support sorting" (see Methods in Cell Biology , Vol. 33 (Darzynkiewicz, Z. and Crissman, H.A., eds., Academic 

io Press); and Dang! and Herzenberg, J. Immunol. Methods 52 :1-14 (1982)). Once the desired beads have been isolated, 
one needs to identify the tag to ascertain the sequence of the oligomer on the bead. 

To facilitate tag identification, one has a variety of options. For instance, one could read the tag directly from the 
bead by sequencing or hybridization, if the tag is an oligonucleotide. One can also amplify oligonucleotide tags to 
facilitate tag identification. The oligonucleotide identifier tags carried by a single solid support or oligomer can be am- 

15 plified in vivo by cloning, or in vitro e.g., by PCR. If the limit of detection is on the order of 100 molecules, then at least 
100 or more copies of each oligonucleotide tag on a bead would be required. Copies of the tag are produced, either 
as single stranded oligonucleotides, double-stranded nucleic acids, or mixtures of single and double-stranded nucleic 
acids, by any of a variety of methods, several of which are described below, and the amplified material is sequenced. 
In the embodiment of the invention in which a separate and distinct oligonucleotide tag is added at each monomer 

20 addition step (as opposed to extending an existing tag at each step), one can amplify all tags at once and then divide 
the amplified material into as many separate sequencing reactions as there were oligomer synthesis steps (employing 
a different sequencing primer for each type of tag). In this embodiment, one could also design the tags so that each 
tag could be amplified separately from the other tags by appropriate choice of primer sequences. The sequencing 
reactions are performed and run on a standard sequencing gel, and the oligomer sequence is deduced from the code 

25 revealed in the resulting sequence information. 

An alternative strategy is to use common PCR primers and common sequencing primers (the sequencing primer 
may even overlap completely or partially with a PCR primer site) and identify the step by hybridization to oligonucleotide 
probes that are complementary to each step-specific sequence in the oligonucleotides from the bead. A single set of 
sequencing reactions is performed on all of the amplified oligonucleotides from a single bead, and the reaction products 

30 are run in a single set of lanes on a gel. The reaction products are then transferred to a suitable hybridization membrane 
and hybridized to a single step-specific probe (see Maniatis et al ., Cold Spring Harbor Laboratory, Cold Spring Harbor, 
NY (1982)). After detection of the resulting signal, the probe is washed from the membrane and another step-specific 
probe is hybridized. One could also use the procedure described in EPO publication No. 237,362 and PCT publication 
No. 89/11548. 

35 Parallel hybridization provides an alternative to sequential hybridization. The sequencing reactions are divided 

into a number of aliquots equal to the number of peptide synthesis steps and run in a separate set of lanes for each 
on the sequencing gel. After transfer of the reaction products to a suitable membrane, the membrane is cut to separate 
the sets of lanes. Each lane set is then hybridized to one of a plurality of step-specific oligonucleotide probes (see 
"Uniplex DNA sequencing" and "Multiplex DNA sequencing," in Plex Luminescent Kits Product Catalog , Bedford, MA, 

40 1990). ~ ^ 

As noted above, a single synthesis solid support (or an attached bead bearing a tag, or in solution in a "well") may 
only comprise a few hundred copies of each oligonucleotide tag. These tags may be amplified, e.g., by PCR or other 
means well known to those skilled in the art, to provide sufficient DNA to be sequenced accurately. The ability to decode 
the oligomers depends on the number of available oligonucleotide identifier tags, the level of amplification that can be 

45 achieved from the available tags, and the accuracy of sequencing that amplified DNA. 

The most commonly used in vitro DNA amplification method is PCR. Alternate amplification methods include, for 
example, nucleic acid sequence-based amplification (Compton, Nature 350:91-92 (1991)) and amplified antisense 
RNA (van Gelderetal., Proc. Nat. Acad. Sci. USA 85:7652-7656 (1988)), and the self-sustained sequence replication 
svstem (3SR, see Guatelli et al, Proc. Natl. Acad. Sci. USA 87: 1874-1878 (1990)). 

50 |f PCR amplification of an oligonucleotide identifier tag is employed, one may encounter "PCR product contami- 

nation," caused by the product of one PCR reaction contaminating a subsequent PCR reaction mixture designed to 
amplify other tags having the same PCR primer binding sites. One may prevent this problem by introducing lability into 
the product sequences and treating subsequent reactions so as to destroy potential contamination carried over from 
previous reactions. A specific example of this strategy, for which commercial kits are sold by PECI and Life Technol- 

55 ogies, is to introduce dUMP into the product. Treating each new PCR reaction with uracil-N-glycosidase degrades any 
dU-containing DNA present, preventing amplification of the contaminant. The template DNA, which contains no dU 
(only dT) is not affected. Of course, the glycosidase is removed or inactivated before amplification is begun. 

Some of the tags described above for peptide synthesis have the unusual characteristic of containing only pyrimi- 
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dines. This means that the uracil glycosidase strategy (Perkin Elmer Cetus Instruments (PECI) Catalog, Alameda 
(1991)) will work on only half of the strands produced-those containing T's (or U's). One cannot introduce dUMP into 
the complementary, purineonly strand; however, the purine strand is highly vulnerable to acid depurination and alka- 
line-mediated scission of the backbone. The combination of these treatments can greatly reduce problems with product 
contamination. Another approach to preventing carryover contamination involves incorporation of a restriction site (Ear l 
could be used for polypyrimidine tags) into the oligonucleotide tag and digestion with the corresponding restriction 
enzyme prior to amplification of a reaction suspected of being contaminated with the tag. This method only works if 
the tag to be amplified will not be cleaved by the enzyme, as would generally be the case for a single stranded oligo- 
nucleotide tag. 

For sequencing amplified DNA, one usually desires to generate single stranded templates. This generation may 
be accomplished by any of several means. One such means is asymmetric PCR, where an excess of one of the primers 
is used to amplify one strand to a level 1 0 to 1 00-fold higher than the other (see, for example, U.S. Patent No. 5,066,584). 
Another means of providing a single stranded template is to by biotinylate one of the primers and purify or remove the 
resulting strand by adsorption to immobilized streptavidin (Pierce Immunotechnoloqy Catalog and Handbook , 1991). 
Yet another means involves generation of RNA transcripts (representing only one of the strands) from an RNA polymer- 
ase promoter and sequencing the transcripts with reverse transcriptase (Sommereta]., Chapter 25, In PCR Protocols: 
A Guide to Methods and Applications , supra ). If the tags are composed of only pyrimidine nucleotides, then all purine 
strands can be eliminated by acid/base treatment, leaving the pyrimidine strand for sequencing. 

The use of separate sequencing primers for each step-specific oligonucleotide requires a separate, conventional 
sequencing reaction for each step-specific primer. Using primers that are differentially labeled would allow the identifier 
tags from a single solid support to be sequenced in a single reaction and run in a single lane set (2 lanes) on a gel. 
There are now commercially available primers labeled with distinguishable fluorophores that are suitable for this pur- 
pose (ABI Catalog, incorporated herein by reference). Sets of chemiluminescent labels now distributed commercially 
may also be used (Bronstein et aj., BioTechniques 8: 310-314 (1990)). 

DNA sequencing enzymes which may be employed in the present invention include Tag DNA polymerase, E. coli 
DNA polymerase I (or the Klenow fragment), T7 polymerase, Sequenase™ and Sequenase II™ (Modified T7 DNA 
polymerases), Bst DNA polymerase, and reverse transcriptase (from AMV, MMLV, RSV, etc., see USB Enzymes for 
DNA Sequencing , U.S. Biochemical Corp, 1991, Cleveland OH). 

The sequence of an oligonucleotide tag may also be identified by a high fidelity DNA hybridization technique. To 
this end, very large scale immobilized polymer synthesis with oligonucleotides may be useful (see PCT patent publi- 
cation Nos. 92/10587 and 92/10588). 

VIII. Screening Receptors with Synthetic Oligomer Libraries 

The tagged synthetic oligomer libraries of the present invention will have a wide variety uses. By way of example, 
these libraries can be used in determining peptide and nucleic acid sequences that bind to proteins, finding sequence- 
specific binding drugs, identifying epitopes recognized by antibodies, and evaluating a variety of drugs for clinical and 
diagnostic applications, as well as combinations of the above. Peptides as short as about five amino acids long might 
be useful in receptor-binding studies, for example. 

Synthetic oligomers displayed on small beads can be screened for the ability to bind to a receptor. The receptor 
may be contacted with the library of synthetic oligomers, forming a bound member between an receptor and the oligomer 
able to bind the receptor. The bound member may then be identified. As one example, the receptor may be an immu- 
noglobulin. 

The techniques for selection of individual beads displaying ligands on their surface are analogous to FACS methods 
for cloning mammalian cells expressing cell surface antigens or receptors. Therefore, methods for selecting and sorting 
beads will be readily apparent to those skilled in the art of cell sorting. For example, a receptor can be labelled with a 
fluorescent tag and then incubated with the mixture of beads displaying oligomers. After washing away unbound or 
non-specifically bound receptors, one can then use FACS to sort the beads and to identify and isolate physically indi- 
vidual beads showing high fluorescence. 

Alternatively, affinity adsorption techniques can be employed in conjuction with the libraries of the invention. The 
mixture of beads can be exposed to a surface on which a receptor has been immobilized (see PCT patent publication 
No. 91/07087, incorporated herein by reference). After washing to remove unbound beads, one can then elute beads 
bound to the surface using conditions that reduce the avidity of the oligomer/receptor interaction (low pH, for example). 
The process of affinity adsorption can be repeated with the eluted beads, if desirable. Finally, individual beads are 
physically separated, for example, by limited dilution, by FACS, or by methods similar to those in which cells are incu- 
bated with a receptor coupled to small superparamagnetic beads and then cells expressing a ligand for the receptor 
are extracted using a high power magnet (see Miltenyi et al., Cytometery 11:231-238 (1990)). Magnetically selected 
cells can be further analyzed and sorted using FACS. Radionucleotides may also serve to label a receptor. 
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Alternatively, the present invention can be used to generate libraries of soluble tagged oligomers, which can be 
used with a variety of screening methods. For instance, the oligomer library can be synthesized on beads with an 
identifying tag encoding the oligomer sequence. The microscopic beads are placed in individual compartments or wells 
that have been "nanofabricated" in a silicon or other suitable surface. The oligomers are cleaved from the beads and 
5 remain contained within the compartment along with the bead and the attached identifier tag(s). In one embodiment, 
the bottom surface is coated with the receptor, and after the addition of binding buffer and a known ligand for that 
receptor that is fluorescently labelled, one effectively has a solution phase competition assay for novel ligands for the 
receptor. The binding of the fluorescently labelled ligand to the receptor is estimated by confocal imaging of the mon- 
olayer of immobilized receptor. Wells with decreased fluorescence on the receptor surface indicate that the released 
10 oligomer competes with the labelled ligand. The beads or the tag in wells showing competition are recovered, and the 
oligonucleotide tag is amplified and sequenced to reveal the sequence of the oligomer. 

The beads are loaded in the wells by dispersing them in a volume of loading buffer sufficient to produce an average 
of one bead per well. In one embodiment, the solution of beads is placed in a reservoir above the wells, and the beads 
are allowed to settle into the wells. Cleavage of the oligomers from the beads may be accomplished using chemical 
is or thermal systems, but a photodeavable system is preferred. 

Recovery of identifier-tagged beads from positive wells may be effectuated by a micromanipulator plucking out 
individual beads. However, a preferred mode involves the use of beads that have been previously labelled with a 
fluorescent tag. A laser of the appropriate wavelength is then used to bleach the resident beads in only the positive 
wells. All the beads are then removed en masse and sorted by FACS to identify the bleached positives. The associated 
20 tags may then be amplified and decoded. 

In a variation of this assay, the oligomer and tag may be synthesized attached to a common linker, which, in turn, 
is bound to the solid support. After placing the beads in the wells, one can cleave the linker from the bead, producing 
a tagged oligomer in solution. An immobilized receptor, such as a receptor bound to a bead or a receptor immobilized 
on one surface of the well, can be screened in a competition assay with the oligomer and a fluorescently labeled ligand. 
25 Instead of recovering the beads, one may recover the beads bearing immobilized receptors and sort the beads using 
FACS to identify positives (diminished fluorescence caused by the library oligomer competing with the labeled ligand) 
or one can determine the fluorescence emitting from the well surface coated with receptor. The associated identifier 
tag may then be amplified and decoded. 

In a third variation of this approach, soluble tagged oligomers, produced either by cleavage of the linked oligomer 
30 and tag from the solid support as described above, or synthesized by the VLSIPS™ method described above, or 
synthesized in solution without a solid support, are incubated with an immobilized receptor. After a wash step, the 
bound, tagged oligomers are released from the receptor by, e.g., acid treatment. The tags of the bound oligomers are 
amplified and decoded. 

3S ix. An Automated Instrument for Oligomer Synthesis and Tagging 

The coupling steps for some of the monomer sets (amino acids, for example) require a lengthy incubation time, 
and a system for performing many monomer additions in parallel is desirable. This can be accomplished with an au- 
tomated instrument able to perform 50 to 100 parallel reactions (channels). Such an instrument is capable of distributing 

40 the reaction mixture or slurry of synthesis solid supports, under programmable control, to the various channels for 
pooling, mixing, and redistribution. 

Much of the plumbing typical of peptide synthesizers is required, with a large number of reservoirs for the diversity 
of monomers and the number of tags (up to 80 for a 10 step synthesis, in one embodiment) employed. The tag dis- 
pensing capability will translate simple instructions into the proper mixture of tags and dispense that mixture. Monomer 

45 building blocks will also be dispensed, as desired, as specified mixtures. Reaction agitation, temperature and time 
control may be provided. An appropriately designed instrument may also serve as a multi-channel peptide synthesizer 
capable of producing 1 to 50 mgs (crude) of up to 100 specific peptides for assay purposes. See PCT patent publication 
91/17823. 

50 EXAMPLE I. SYNTHESIS ON GLASS BEADS OF 4 FLUORESCENTLY TAGGED PENTAPEPTIDES 

A. Derivatization of Glass Beads 

About 0.5 g of 3-10um (um)diameter silica beads (Polyscience) were washed with refluxing 10% aqueous HN0 3 
55 for 20 min. The beads were pelleted and washed with distilled water (5x) and methanol (3x) and dried at 125°C for 1 2 
hours. Beads were vortexed with a 5% solution of aminopropyltriethoxysilane in acetone for 10 hours, pelleted and 
then washed with acetone (2x), ethanol (5x), and methylene chloride (2x) and dried at 125°C for 45 min. Beads were 
suspended in dry DMF (1 ml) containing diisopropylethylamine (17 uJ, 100 umoles) and a solution of Fmoc-b-alanine, 
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pentafluorophenyl ester (200 mg, 420 u/noles, Peninsula Labs) in distilled water (1.5 ml) was added. After vortex 
treatment for 11 hours, the beads were pelleted and washed with DMF (3x) and methylene chloride (2x). Beads were 
treated with a 10% solution of acetic anhydride in DMF containing 0.05 mol of 4-dimethylaminopyridine to cap any 
underivatized aminopropyl groups, and then washed with DMF (2x) and methylene chloride (2x). Beads were vortexed 
5 with a 20% solution of piperidine in DMF and the release of the Fmoc-piperidine adduct quantitated by monitoring the 
absorbance spectrum of the supernatant at 302 nm (e 302 = 7800 M" 1 cnrr 1 ). An estimate of the degree of substitution 
of 10 ujnoles of amino grcups/g beads was thus obtained. Finally, the beads were washed with ethanol (5x) and meth- 
ylene chloride (2x) and then dried at 85° C for 12 hours. 

10 B. Preparation of Boc-Gly-L-Phe-L-Leu-OH 

Glycyl-L-Phenylalanyl-L-leucine (552 mg, 15 mmol, Bachem) was dissolved in a solution containing distilled water 
(10 ml) and 1 M NaOH (1 .5 ml). The solution was cooled in an ice bath and was treated with a solution of di-tert utyl 
pyrocarbonate (337 mg, 1 .5 mmol) in p-dioxane (12 ml). A white precipitate rapidly formed but redissolved after stirring 
is at room temperature for 4 hours. The solution was concentrated to dryness in vacuo, the residue taken up in water (5 
ml), and the pH adjusted to 2.5 by the addition of 1 M KHS0 4 . The aqueous suspension was extracted with EtOAC 
(2x,15 ml), the organic layer separated, and dried over MgS0 4 . After removal of the solvent in vacuo , the residue was 
triturated with hexane to afford Boc-Gly-L-Phe-L-Leu-OH as a white solid (yield=642 mg, 98%). 

20 C Preparation of Gly-L-Phe-L-Leu Beads 

Boc-Gly-L-Phe-L-Leu-OH (44 mg, 0.1 mmol), benzotriazol-1-yloxytris(dimethylamino)phosphonium hexafluoro- 
phosphate (44 mg, 0.1 mmol) and 1- hydroxy be nzotriazole hydrate (14 mg, 0.104 mmol) were dissolved in dry DMF (1 
ml). Diisopropylethylamine (20 uJ, 0. 1 1 5 mmol) was then added and 0.65 ml of this solution was immediately transferred 
25 to a microcentrifuge tube containing 80 mg of amino derivatized glass beads. The sealed tube was vortexed for 3.5 
hours, and the beads were then pelleted and washed with DMF (3x) and methylene chloride (2x). The beads were 
then deprotected with a 50% solution of trifluoroacetic acid in methylene chloride for 30 min., washed with methylene 
chloride (2x), ethanol (2x), and methylene chloride (2x), and dried at 55°C for 1 hour. 

30 D. Preparation of Gly-Gly-L-Phe-L-Leu Beads (SEQ ID NO:10) 

Fmoc-glycine pentafluorophenyl ester (46 mg, 0.1 mmol) was dissolved in dry DMF (1 ml) containing diisopropyl- 
ethylamine (17 u.l, 0.1 mmol). About 0.65 ml of this solution was added to 20 mg of Gly-L-Phe-L-Leu beads in a micro- 
centrifuge tube, and the tube was vortexed for 3 hours. The beads were pelleted and washed with DMF (4x) and 
35 methylene chloride (2x). Deprotection was effected by treatment with a 20% solution of piperidine in DMF for 30 min. 
The beads were washed with DMF (2x), ethanol (2x), and methylene chloride (2x) and dried at 60°C for 4 hours. 

E. Preparation of L-Pro-Gly-L-Phe-L-Leu Beads (SEQ ID NO:11) 

40 Fmoc-L-prol in e pentafluorophenyl ester (50 mg, 0.1 mmol) was dissolved in dry DMF (1 ml) containing diisopro- 

pylethylamine (17 uJ, 0.1 mmol). About 0.65 ml of this solution was added to 20 mg of Gly-L-Phe-L-Leu beads in a 
microcentrifuge tube, and the tube was vortexed for 3 hours. The beads were pelleted and washed with DMF (4x) and 
methylene chloride (2x). Deprotection was effected by treatment with a 20% solution of piperidine in DMF for 30 min. 
The beads were washed with DMF (2x), ethanol (2x), and methylene chloride (2x) and dried at 60 °C for 4 hours. 

45 

F. Fluorescein Staining of Gly-Gly-L-Phe-L-Leu Beads 

About 5.4 mg of Gly-Gly-L-Phe-L-Leu beads were suspended in 450 jil of aqueous borate buffer (pH 8.5) and 54 
u.l of a 10 u.M solution of fluorescein isothiocyanate (FITC) added. After vortex treatment for 1 .5 hours, the beads were 
50 washed with buffer (5x), ethanol (2x), and methylene chloride (2x). FACS analysis indicated that approximately 10% 
of available amino groups had been titrated with FITC. 

G. Co-coupling of L-Tyrosine and Biotin to Mixture of L-Pro-Gly-L-Phe-L-Leu and FITC labelled Gly-Gly-L-Phe-L-Leu 
Beads 

55 

5 mg of FITC labelled Gly-Gly-L-Phe-L-Leu beads and 5 mg L-Pro-Gly-L-Phe-L-Leu beads were mixed together 
in a single tube, vortexed with a 0.1 mM solution of diisopropylethylamine in methylene chloride, and the suspension 
was divided into two equal portions. The beads were pelleted, and to one portion was added a solution containing 
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Fmoc-O-tert-butyl-L-tyrosine pentafluorophenyl ester (59 mg, 95 jimol), N-hydroxysuccinimidobiotin (1.7 mg, 5 u/nol) 
and diisopropylethylamine (17 jil, 100 jimol) in dry DMF (1 ml). After vortexing for 3 hours the beads were washed with 
distilled water (2x), e anol (2x), methylene chloride (2x) and DMF (1x). Fmoc deprotection was effected by treatment 
with a 20% solution of piperidine in DMF for 30 min., and tert -butyl side chain protecting groups were removed by 
s treatment with 25% trifluoroacetic acid in methylene chloride for 30 min. The pelleted beads were washed with meth- 
ylene chloride (2x), ethanol (2x), and TBS (1x). 

H. R-Phycoerythrin Staining of Biotinylated L-Tyr(Glv/L-Prol-Gly-L-Phe-L-Leu Beads (Mixture of SEQ ID NO:12 and 
SEQ ID NO: 13) 

10 

Biotinylated L-tyrosine beads from (G) above were suspended in TBS (0.5 ml) and treated with 10uJ of R-phyco- 
erythrin-avidin conjugate (Molecular Probes) for 30 min. Pelleted beads were washed with TBS (5x). 

I. Co-coupling of L-Proline and Biotin to Mixture of L-Pro-Gly-L-Phe-L-Leu and FITC labelled Gly-Gly-L-Phe-L-Leu 
is Beads (Mixture of SEQ ID NO:15 and SEQ ID NQ:14) 

5 mg of a mixture of L-Pro-Gly-L-Phe-L-Leu and FTTC labelled Gly-Gly-L-Phe-L-Leu beads were treated with a 
solution containing Fmoc-L-proline pentafluorophenyl ester (48 mg, 95 umol), N-hydroxysuccinimidobiotin (1.7 mg, 5 
umol), and diisopropylethylamine (17 u.l, 100 u.mol) in dry DMF (1 ml). After vortex treatment for 3 hours, the beads 
20 were washed with DMF (2x), ethanol (2x), methylene chloride (2x), and DMF (1x). Fmoc deprotection was effected by 
treatment with a 20% solution of piperidine in DMF for 30 min., and by way of control, the beads were treated with 25% 
trifluoroacetic add in methylene chloride for 30 min. The pelleted beads were washed with methylene chloride (2x), 
ethanol (2x), and TBS (1x). 

25 J. Tri-Color Staining of Biotinylated L-Pro-(Gly/L-Prol-Gly-L-Phe-L-Leu Beads 

Biotinylated L-proline beads from (i) above are suspended in TBS (0.5 ml) and treated with 20 uJ Tri-Color: strepta- 
vidin conjugate (Caltag Labs) for 30 min. Pelleted-beads are washed with TBS (5x). 

30 K. Selection of Beads Containing Peptide Ligands for Monoclonal Antibody 3E7 

Monodonal antibody 3E7 was raised against the opioid peptide beta-endorphin. The binding specificity of MAb 
3E7 has been well characterized by solution assays with chemically synthesized peptides. The equilibrium binding 
constants (Kd) of the peptides considered here are as follows: YGGFL is 6.6 nM; and YPGFL, PPGFL, and PGGFL 

35 are each >1 mM; thus, only the peptide YGGFL shows appreciable affinity for the antibody. 

A mixture of beads containing either YGGFL, YPGFL, PGGFL, or PPGFL and their respective tags (see above) 
are added in phosphate buffered saline (PBS) containing monoclonal antibody 3E7 that has been previously conjugated 
to colloidal superparamagnetic microbeads (Miltenyi Biotec, West Germany). After a 16h (hr) incubation at 4°C, beads 
which bind the 3E7 antibody are selected using a high strength magnet. The selected beads are then analyzed by flow 

40 cytometry. Analysis of the selected beads reveals that they contain both fluorescein and R-phycoerythrin, indicating 
that only beads displaying the peptide YGGFL are selected by the 3E7 antibody. 

EXAMPLE 2: SYNTHESIS ON GLASS BEADS OF 4 PENTAPEPTIDES TAGGED WITH OLIGONUCLEOTIDE 
IDENTIFIERS 

45 

A. Synthesis of Identifier Oligonucleotides (l)-(IV) 

The oligonucleotide identifier tags (l)-(IV) have the sequences shown below. The regions complementary to the 
5' and 3' PCR primers are underlined. The regions complementary to the step-specific sequencing primers are shown 
so in lower case: there are two steps in this example. The monomer encoding region is shown in bold type: CT 7 encodes 
Gly, TCT € encodes L-Pro, and TTCT 5 encodes L-Tyr in this case. Thus oligos (l)-(IV) code respectively for Gly in 
position 2, L-Pro in position 2, L-Tyr in position 1, and L-Pro in position 1. 



55 
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(I) 5MrtB2^ 1T1 CI A CCTCTC^ 1 1 1 1CTC 

CTTCTTTTTTTCTCTCCCT 

cH-rT^CCTCTCrCTCTrCTTTCC^' (SEQIDNOrl) 



(II) 5'-BlB2^JlTlll ICCTCTCCCTCTTITCTCCTCT ACl A 1 1 1 1CTC 

ci i rci a a 1 1 1 CTCTCCcrcTxrrccTCTcrccccm 

r trrTCTCCTCTCTCTCA III 1 1CC -3' (SEQIDNOS) 



(HI) 5 -BlB2-CnTC 1 1 CCTCTCCCTCTTTTCTCCTCTTTCTTTTTCTC 
CTTTTCTTTTI I ACTCCCTCTCTCCTCrCTCtcttcct^ 
ctrtctCCTCTCCTCTCTCTCTTCTTTCC-3* (SEQIDNOS) 



(IV) 5-BlB2-CriTC riCCTCTCCCT CI riT CTCCT l'l ACITA'A'A A CTC 
CTTTCTTTTTTCTCTCCCTC 

ctrtctCCTCTCCTCTCTCT CITCTTTCC «3' (SEQIDNO:4) 

where: B 1 =p-Maleimido-C 6 H4-(CH 2 )3-C(0)NH-(CH 2 )6-0-P0 2 -0-, and B 2 = CH 2 -CH[(CH 2 ) 4 -NH-Biotin]-CH 2 -0-P0 2 - 
O-. 

Oligos (l)-(IV) are synthesized on an ABI PCR-mate synthesizer using commercially available (Sigma) DMT-O- 
Me phosphoramidites. The N4-amino group of cytidine is protected as the benzoyl derivative. The 5* terminal (B1 ) and 
penultimate (B2) phosphoramidites are respectively N-MMT-C 6 -AminoModifer (Clonetech) and Biotin Phosphoramidite 
(Glen Research) for each oligonucleotide. The fully protected O-methyl phosphotriester oligomers are cleaved from 
the CPG support by treatment with concentrated NH40H at 25°C for 1 hour. The crude products are purified by affinity 
chromatography on a monomeric avidin-agarose column (Pierce), and the full-length material is eluted with 2 mM 
biotin. The 5'-MMT group is removed by treatment with 80% acetic acid for 1 hour at 25° C, and the solution is evaporated 
to dryness. The products are dissolved in PBS, pH 8.0, and treated with a 50-fold excess of succinimidyl 4-(p-male- 
imidophenyl) butyrate (Pierce) in DMF for 30 min. The modified, protected oligonucleotides are desalted by RP-HPLC, 
lyophilized and stored under nitrogen. 

The primers used for PCR and sequencing are prepared in the normal fashion and are shown below: 

5' PCR Primer S'-TCCTCTCCCTCrnTCTCCTCT-B* (corresponds to 
bases 7-28 of SEQ ID NO:l) 



3' PCR Primer 5-Biotin-GGAAAGAAGAGAGAGAGGAGAGG-3' (SEQ 

ID NO:5) 

Step #1 Sequencing Primer 5-AGAGAGGGGAAAGGAAGA-3* (SEQ ID 

NO:6) 
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Step #2 Sequencing Primer 5-AGGAAAGGAGAGAAAGGG-3* (SEQ ID 

NO:7) 

5 B. Preparation of Gly-Gly-L-Phe-L-Leu Beads Bearing Identifier Oligo (I) 

5 mg of Gly-L-Phe-L-Leu beads are treated with a solution containing Fmoc-Gly-OH (99.95 jomol), Fmoc-Cys 
(Npys)-OH (0.05 ujtioI, Calbiochem), ben20triazol-1-yloxytris-(dimethylamino)phosphonium hexafluorophosphate 
(100 fimol), 1 -hydroxybenzotriazole hydrate (100 umol), and diisopropylethylamine (150 umol) in dry DMF (1 mL) for 
w 2 hours. The beads are washed with DMF (2x) and then with methanol (2x) and then treated with a 1 0 mM DTT solution 
in methanol for 30 min. to deprotect the cysteine residues. The beads are quickly washed with ice-cold methanol (2x), 
pelleted, and then reacted for 20 min with 100 ui of a 0.1 mM solution of oligo (I) in methanol. After washing with 
methanol (2x) and then with DMF (2x), the beads are deprotected for 20 min. with 20% piperidine in DMF. Finally, the 
beads are washed with DMF (2x), methanol (2x), and then methylene chloride (2x) and dried at 45°C for 1 hour. 

15 

C Preparation of L-Pro-Glv-L-Phe-L-Leu Beads Bearing Identifier Oligo (II) 

5 mg of Gly-L-Phe-L-Leu beads are treated as in (b) above, substituting Fmoc-L-Pro-OH and Oligo (II) for Fmoc- 
Gly-OH and Oligo (I), respectively. 

20 

D. Preparation offOtBu)-L-Tyr-(Glv/L-Pro)-Glv-L-Phe-L-Leu Beads Bearing Identifier Oligos (m and l/ll) 

Beads from (b) and (c) are pooled and divided into two equal portions. One portion is treated as in (b), substituting 
Fmoc(OtBu)-L-Tyr-OH and Oligo (III) as appropriate. 

25 

E. Preparation of L-Pro-(Gly/L-Pro)-Gly-L-Phe-L-Leu Beads Bearing Identifier Oligos (IV and l/ll) 

The second pool is treated as before, substituting Fmoc-L-Pro-OH and Oligo (IV) as appropriate. 

30 f. Reconstitution and Deprotection of the Peptide Library 

Beads from (d) and (e) are pooled, and the phosphate, amino acid side-chain, and nucleotide exocyclic amino 
protecting groups are removed as follows. A one hour treatment with a 1:2:2 mixture of thiophenol: triethylamine: p- 
dioxane is followed by washing the beads with methanol (2x) and then methylene chloride (2x), and then the beads 
35 are treated for 5 min. with 95:5 trifluoroacetic acid: ethanedithiol. After a wash with methanol (3x), the beads are treated 
at 55°C with 1:1 ethylenediamine: ethanol for 1 hour and then washed first with ethanol (2x) and then with PBS (2x). 
This collection of beads constitutes the library and contains approximately equal quantities of the 4 immobilized peptides 
YGGFL, YPGFL, PGGFL and PPGFL. Additionally, each bead carries two distinct 113 bp oligonucleotide sequences 
encoding the identities of both the first and second amino acids of the peptide on that bead. 

40 

G. PCR Amplification of Oligonucleotide Identifier Tag 

After a FAC sort of affinity purified beads into individual 0.5 mL polypropylene tubes, 25 u.l of TBS containing 0.1 
u.g salmon sperm DNA (as carrier) are added together with 25 uJ of 2X PCR Amplification Buffer (PECI) to each tube. 

45 The 2X buffer contains: 100 mM KCI; 20 mM Tris-CI, pH 8.4, 20°C; 6 mM MgCI 2 ; 0.4 mM dNTP's; 1 uM of 5' PCR 
primer; 1 uA/l of 3' PCR primer; and 100 units/ml Taq DNA polymerase. 

After buffer addition, the sample is covered with 50 uJ of mineral oil and transferred to an automated thermal cycler. 
In the thermal cycler, the samples are heat denatured at 95°C for 2 min., and then cycled 35 times through 3 steps: 
95° C/30 s (sec), 60° C/1 min., 72° C/1 min., which steps are followed by an in cuba on at 72° C for an additional 5 

so min. and then the tubes are cooled and held at 1 5° C until ready for processing on streptavidin beads. The mixture is 
heated to 95°C to denature the strands, and the biotinylated purine strand and excess 3' PCR primer are removed by 
addition of streptavidin -coated beads. The tubes are centrifuged at 300s* 1 for 5 min. The supernatant is used in the 
sequencing reactions, as described below. 

55 H. Sequencing of PCR Amplified Oligonucleotide Tags 

The amplified oligonucleotides from individual bead isolates are sequenced in a pair of reactions (using ddA or 
ddG as chain terminators) with either the Step #1 -specific or the Step #2 -specific sequencing primers. 
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To anneal the template and primer, for each set of two sequencing lanes, a single annealing and subsequent 
labeling reaction is run by combining 8.5 uJ of sequencing primer (conc.= 0.25 (imol/jil), 1.5 uJ Sequenase™ 5X se- 
quencing buffer (200 mM Tris HCI, pH 7.5; 100 mM MgCI 2 ; and 250 mM NaCI), and 10 u.1 of template DNA from the 
amplification supernatant above. The samples are heated for 2 minutes at 65° C and allowed to cool slowly to room 
5 temperature (approx. 10 minutes). 

The labeling reaction is performed as follows. Sequenase™ (v2.0) is diluted 1 : 20 with TE (10 mM Tris HCI, pH 
7.5; and 1 mM EDTA), and a labeling cocktail containing a 2 : 3.5 ratio of diluted enzyme to labeling mix (i.e., a 4 : 2: 
1 mixture of 1 50 nM dGTP, 0. 1 M dithiothreitol, alpha-^S-dATP, >1 000 Ci/mmol) is prepared. About 5.5 uJ of the cocktail 
are incubated with 10 uJ of annealed template/primer (from (i)) at 25°C for 5 min. 
10 The termination reactions are performed as follows. 6 uJ of labeling reaction mixture are added to 5 u,l of each of 

the appropriate ddXTP termination reaction mixes (i.e., 80 jiM dGTP, 80 u.M dATP, 50 mM NaCI, and 8 jiM ddGTP or 
8 u.M ddATP). After incubation at 37° C for 5 min., about 8 uJ of Stop Solution (95% formamide, 20 mM EDTA, 0.05% 
bromophenol blue, and 0.05% xylene cyanol) are added to each of the termination reactions. 

The sequencing gel is comprised of 6% total acrylamide (1 9:1 acrylamide/bis), 0.09 M Tris base, 0.09 M boric acid, 
15 1 mM EDTA, and 7 M urea. The gel is polymerized by addition of 1 .9 uJ of 25% ammonium persulfate per ml and 0.72 
u.l of TEMED per ml of above gel solution. The gel is allowed to polymerize at least one hour and is prerun at least 20 
minutes prior to sample loading. Gel plates are then maintained between 40 and 50°C prior to and during the run. 

Reactions are heated to 85-95°C for 2 minutes prior to loading, and the gel is run until the bromophenol blue dye 
reaches the bottom of the gel. The sequences of interest run between the bromophenol and xylene cyanol markers. 
20 The information required to identify the sequence of monomers in the oligomers attached to the bead is contained in 
the DNA sequence information. 

EXAMPLE 3: PARALLEL SYNTHESIS OF PEPTIDES AND OLIGONUCLEOTIDE TAGS ON CARBOXYL BEADS 

25 A. Synthesis of Phosphoramidites (I HIV) 

The 3'-(allyl N.N'diisopropyl-phosphoramidites) of 5'-DMT derivatives of: (1) N 6 -(alryloxy)carbonyl-7-deaza-2'-de- 
oxyadenosine; (2) N4-(allyloxy)carbonyl-2'-deoxy-cytidine; (3) N2-(allyloxy)carbonyl-7-deaza-2'-deoxyguanosine; and 
(4) thymidine (see Figure 6), are prepared according to the procedures of Hayakawa et al., J. Amer. Chem. Soc. 112: 
30 1691-1696 (1990)). 

B. Derivatizing Carboxyl Beads With a Diamine Linker 

Preparation of a bifunctional bead material for parallel synthesis of peptides and oligonucleotides is illustrated in 
35 Figure 7. Three 50 mg aliquots of 4.5 u.m diameter polystyrene/polydivinylbenzene/polymethylmethacrylate/COOH 
beads (Bangs' Laboratories) were each placed in a separate microcentrifuge and treated as follows. First, the beads 
were treated with 0.1 N aqueous HCL (3 ml) and stirred by vortexing for 15 minutes. The beads were then pelleted 
with a microcentrifuge, the liquid supernatant decanted, and the remaining bead pellet successively washed (by vor- 
texing, pelleting, and decanting: a process referred to as "washed") with water (3 x 1 ml) and dimethylformamide (DMR 
40 3x 1 mL). 

The compounds 2-(1H-benzotriazol-1-yl)-1 ,1,3,3-tetramethyluronium hexafluorophosphate (HBTU, 38 mg, 0.10 
mmol), 1 -hydroxy benzotriazole (HOBT, 15mg, 0.10 mmol), and DMF (0.5 ml) or dichloromethane (0.5 ml) were added 
to the bead pellet. Diisopropylethylamine (DIEA, 54 uJ, 030 mmol) was added, and the suspension was vortexed for 1 
min. The compound 4,9-dioxa-1 ,12-dodecanediamine (20 uJ, 0.10 mmol) was then added, and the reaction was vor- 
45 texed for 30 minutes. The reaction was then diluted with DMF (1 ml), the beads were pelleted, and the supernatant 
decanted. The pellet was treated with 9:1 DMF/water (1.0 ml) and vortexed for 15 minutes. The beads were then 
pelleted, the supernatant decanted, and the beads washed with DMF (3 x 1.0 ml). 

C. Attaching Peptide and Oligonucleotide Synthesis Linkers 

so 

100 mg of the beads are treated with a mixture of 4-Fmoc-aminobutyric acid (0.1 mmol) and 4-p,p'-dimethoxytrityl 
(DMT) -hydroxy butyric acid (0.1 u.mol) in the presence of HBTU (0.1 mmol), HOBt (0.1 mmol), and DIEA (0.1 mmol) in 
9:1 CH 2 CI 2 :DMF (1 .0 ml). After vortex treatment for 30 minutes, the reaction mixture is diluted with DMF (1 .0 ml), the 
beads pelleted, and the supernatant decanted. The beads are washed with DMF (3 x 1 .0 ml). The coupling procedure 
55 js then repeated with fresh reagents, and the beads are pelleted and washed as described above. 
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D. Building a 3' PCR Priming Site on the Hydroxy Linkers 

The parallel assembly of oligonucleotide-tagged peptides on beads is illustrated in Figure 8. A PCR priming site 
of 20-25 nucleotides is assembled as follows. Note that all reagents used are anhydrous, and reactions occur under 

5 an atmosphere of dry argon. About 10 mg of the beads are subjected to an eight-step reaction sequence to couple a 
protected phosphoramidite. The reaction steps are: (1) beads are washed for 05 minutes with acetonitrile (MeCN); (2) 
DMT groups are removed with 3% trichloroacetic acid in CH 2 CI 2 for 15 minute; (3) beads are washed with MeCN for 
3 minutes; (4) beads are treated with 0.1 M phosphoramidite (I, II, III, or IV) in MeCN containing either 0.5 M (4-nitro- 
phenyl) tetrazole or 0.5 M pyridinium hydrochloride and 1.0 M imidazole for 2 min.; (5) beads are washed with MeCN 

10 for 0.5 minutes; (6) beads are capped with a mixture of Ac20/2,6-lutidine/THF (1 :1 :8) containing 5% DMAP; (7) beads 
are oxidized with 1 M ! BuOH in CH 2 CI 2 for 0.8 minutes; and (8) beads are washed for 0.5 minutes with MeCN. Steps 
one through eight are repeated from one to 25 times to assemble a PCR priming site of up to 25 nucleotides. 

E. Coupling of First Amino Acid to Amino Linkers 

15 

Peptide and nucleotide couplings may be alternated, as illustrated in Figure 8. To couple an amino acid (or peptide), 
the Fmoc group is first removed from the beads by treatment with 30% piperidine in DMF for 60 min. The beads are 
washed 3 times with DMF. The beads are then treated with a solution containing the appropriate amino acid (0.1 M), 
HBTU (0.1 M), HOBt(0.1 M), andDIEA(0.1 M) ing:1 CH 2 CI 2 :DMF for 30 min. The coupling is then repeated with fresh 
20 reagents for a further 30 min. and the beads are washed with DMF (3x) and then with MeCN (3x). 

F. Construction of First Oligonucleotide "Codon" 

A "codon" of about 3 to 5 nucleotides uniquely representing the identity of the first amino acid is then built at the 
25 5' end of the oligonucleotide chain using the 8-step coupling cycle in procedure (d) above. 

G. Coupling of Subsequent Amino Acids and "Codon" Construction 

The methods of procedures (e) and (f) are then repeated using the appropriate amino acid and nucleotide building 
30 blocks until the desired peptide and the oligonucleotide coding region are completely assembled. 

H. Construction of a 5' PCR Priming Site 

The 8-step coupling cycle of procedure (d) is used to build a 20-25 nucleotide PCR priming site on the 5' terminus 
35 of the oligonucleotide chains. 

I. Deprotection of the Oligonucleotide and Peptide Chains 

The fully assembled peptide and oligonucleotide chains are deprotected as follows. The amino-terminal Fmoc 
40 groups are removed by treatment with 30% piperidine in DMF and then a wash with THF (3x). To remove the allylic 
protecting groups, the beads are treated with a THF solution containing tris(dibenzylideneacetone) dipalladium-chlo- 
roform complex (0.02 M), triphenylphosphine (0.2 M), and 1:1 n-butylamine/formic acid (1 .2 M) at 50°C for 30 min. and 
the pelleted beads are washed with THF. The beads are washed with 0.1 M aqueous sodium N,N-diethyldithiocar- 
bamate and then water to remove traces of palladium. The amino acid protecting groups are then removed by treatment 
45 with 95:5 TFA/water for 30 min. "Scavenger" reagents such as 1 ,2-ethanedithiol and thioanisole may also be included 
in this acidic deprotection medium (e.g., 2% of each by volume). Finally, the fully deprotected beads are washed with 
aqueous buffer and are ready for interaction with a biological receptor. 

EXAMPLE 4: LIBRARY PREPARATION AND SCREENING 

50 

In this example, two populations of amine derfvatized beads were labeled with oligonucleotides possessing base 
sequences uniquely characteristic of each bead population. The population labeled with an oligonucleotde 95 bases 
in length (95 mer) was subsequently coupled to the peptide YGGFL. The population of beads labeled with an oligonu- 
cleotide 110 bases in length (110 mer) was coupled to phenyalanine (F). The beads were then mixed in the ratio of 
55 twenty F/1 1 0 mer beads for each YGGFL/95 mer bead and stained with a f luorescently labeled antibody 3E7 that binds 
the peptide YGGFL with high affinity. Individual fluorescently stained beads could then be sorted by FACS directly into 
PCR tubes. After PCR, 5 of 6 fluoresently stained beads gave rise to a fragment of amplified DNA 95 bp long. PCR of 
the remaining single bead gave rise to small DNA fragments, possibly being primer dimer. 
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The oligonucleotides used in this experiment are the two tags, two PCR primers, and one sequencing primer. The 
same PCR and sequencing primers were used for the two tags. The two tags differ in their sequence and length. Both 
tags were composed of the bases 7-deazaA, C, and T. 

The 95 mer tag has the sequence: 

CCA CTC A CT ACC ACT CTA CTA TA A CCA CCC CTT CCT ATT CCA AAA TTA 
CAA Act tat etc aac tac ate tCA C AC TC A C TC A T C TCT ACA T CT AC (SEQ ID 
NO:8) 

The 110 mer tag has the sequence: 

CCA CTC ACT ACC ACT CTA CTA TA A CCC TCC CCT ATT CCA AAA TTA CAT 
CCT ATT CCA AAA TTA CAA Act tat etc aac tac ate tCA CAC TCA CTC ATC TCT 



ACA TCT AC (SEQ ID NO:9) 

For each target the underlined sequences represent PCR primer binding sites: the sense primer is at the 5' -end, and 
the anti-sense primer is at the 3' -end. Also, for each target the small case sequence represents the sequencing primer 
binding site. 

A. Bead Preparation 

Beads were purchased from Bang's Laboratories (979 Keystone Way, Carmel, IN 46032) and are composed of 
carboxylated polystyrene (4.5 |im average diameter). These beads were subjected to diamine derivitization by the 
process described below. 

Beads (200 mg) were treated with 1.0 ml of 1 N HCI and vortexed 15 min. The beads were pelleted, decanted, 
and washed with three times (3x) with 1 .0 mL of water each wash and then washed 3x with 1 .0 ml of DMF each wash. 
To the washed pellet was added 2-(1H-benzotriazol-1-yl)-1,1 ,3,3-tetramethyluronium hexafluorophosphate (HBTU; 38 
mg, 0.1 mmole), 1 -hydroxybenzotriazole hydrate (HOBT; 15 mg, 0.1 mmole), 500 uJ of methylene chloride, and 54 \i\ 
of diisopropylethylamine (DIEA; 0.3 mmole). After vortex treatment for 2 min., 20 uJ of diamine (4,9-dioxa-1 ,12-do- 
decanediamine; 94 u,mole) were added. After vortex treatment for 30 min, 1.0 ml of DMF was added, and the beads 
were pelleted by centrifugation. The supernatant was removed, and 1 .0 ml of 1 0% water in DMF was added. The beads 
were vortexed an additional 15 min. and finally washed 3x with 1 .0 ml of DMF each wash. 

B. Oligonucleotide Attachment 

Two different target oligonucleotides were employed in this experiment: a 95 mer and a 110 mer. These oligonu- 
cleotides were composed of the bases cytidine, thymidine, and 7-deaza adenosine. The oligonucleotides were syn- 
thesized with a primary amino group on the 5Merminus (5*-amino modifier C-12, Glen Research). Lyophilized oligonu- 
cleotide (600 pmole) was dissolve in 5 uJ of 0.5 M Na-phosphate, pH 7.7, and the solution was treated with 10 uJ of 
0.2 M disuccinimydylsuberate (DSS). The reaction proceeded 10 min., and then 85 \i\ of ice-cold water were added. 
Un reacted DSS was removed by centrifugation. The supernatant was passed through a G25 spin column that had 
been equilibrated with water. The eluant was immediately frozen and lyophilized to isolate the 5'-N-hydroxysucdnamide 
ester of the oligonucleotide. 

This activated oligonucleotide was dissolved in 50 jil of 0. 1 M Na-phosphate, pH 7.5, which contained 0.1 mg/mL 
of sonicated salmon sperm DNA. This solution was added to 10 mg of diamine derivirized beads. After vortex treatement 
for 3 h (nr.), the beads were washed 2x with 0.4 ml of 0.1 M Na-phosphate, pH 7.5, each wash, and then washed 2x 
with 0.4 ml of 0.1 N NaOH. Finally, the beads were washed with 3x with 0.4 ml of pH 75 buffer. 

C. Peptide Attachment 

To Boc-YGGFL or Boc-Phe (Boc = t-butoxy-carbonyl amine protecting group; 0.1 mmole) was added HBTU (0.1 
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mmote), HOBT (0.1 mmole), 1.0 ml of 10% DMF in methylene chloride, and DIEA (0.3 mmole). After vortex treatment 
to dissolve the solids, 0.4 ml of the peptide solution was added to 3 mg of oligonucleotide-labeled beads. The solution 
containing Boc-YGGFL was added to beads labeled with the 95 mer, and the solution containing Boc-Phe was added 
to beads labeled with the 110 mer. The reaction mixtures were vortexed 30 min. and then diluted with DMF, centrifuged, 
5 decanted, and the bead pellets washed with 3x with 1 .0 ml THE. The Boc protecting groups were removed by treating 
the beads with 0.4 ml of 95% trifluoroacetic acid for 10 min. The deprotection reaction was then diluted with THF, 
centrifuged, and decanted, and the beads were washed with 3x with 1 .0 ml of DMF each wash Finally, the beads were 
washed with 3x with 0.5 ml of 0.1 M Na-phosphate, pH 7.5, each wash and stored as a slurry (10 mg/ml). 

10 D. Mixing, Staining, and Sorting 

The beads coupled with the 95 mer and YGGFL were mixed with the beads that were coupled to the 110 mer and 
F in the ratio of 1:20. Thus, 0.1 mg of 95 mer/YGGFL beads (2 million beads) were mixed with 2.0 mg of the 110 mer/ 
Phe beads (40 million beads). The mixture was suspended in blocking buffer (PBS, 1% BSA, and 0.05% Tween-20) 
'5 and incubated at room temperature for 1 h. The beads were next pelleted by centrifugation and resuspended in a 
solution of an FITC-labeled monoclonal antibody 3E7 that binds the peptide YGGFL (1 ng/ml). The suspension was 
incubated 0.5 h on ice and then centrifuged to isolate the bead pellet. 

The beads were resuspended in PBS for delivery into the fluorescence activated cell sorting (FACS) instrument 
(Becton Dickinson FACSORT Plus). Beads that had bound to the fluorescently labeled antibody were identified by their 
20 acquired fluorescence, and fluorescent beads were isolated by sorting into PCR tubes. One, ten, or one hundred 
fluorescent beads were sorted into each PCR tube. In an analogous manner, non-fluorescent beads were also sorted 
into PCR tubes. 

E. Amplification of Sorted Beads 

25 

To each PCR tube containing a bead or beads was added 25 jiL of PCR buffer (20 mM Tris-HCI, pH 8.7; 10 mM 
KCL; 10 mM (NH4) 2 S0 4 ; 2 mM MgCI 2 ; 0.1% Triton X-100; 0.14 mg/ml BSA; 200 \xm dATP; 200 \\m dGTP; 200 ujn 
dCTP; 200 u/n dTTP; 2 um primer #1 ; 2 jxm primer #2; and 05 units of Pfu DNA polymerase). Reactions were subjected 
to 40 cycles of 95°C for 05 min., 55°C for 1 min., and 72°C for 1 min. 

30 Gel loading dye (2 u,L) was added to 1 0 jil of each PCR, and the sample was run on a 2% low melting point agarase 

gel. DNA fragments were visualized by staining with ethidium bromide and exposure to UV light Five of six of the tubes 
containing single flourescent beads gave rise to DNA fragments 95 base pairs in length, confirming that these beads 
were coupled to YGGFL and not F. Tubes containing 10 or 100 fluorescent beads also gave rise to 95 mer DNA 
fragments. Conversly, none of the tubes containing 1 , 10, or 100 non -fluorescent beads gave rise to 95 mer fragments. 

35 There were, however, anomalous amplification products smaller than 110 bp from amplification of the tags of non- 

fluorescent beads. These anomalous products may have arisen through the use of unprotected oligonucleotide tags 
in this example, which may have allowed the free exocyclic amines to couple to the F amino acid, thereby rendering 
the tag subject to anomalous amplification. This problem would not have affected the 95 mer tag to the same extent, 
because YGGFL would be less reactive with the exocyclic amines than F. 

40 

EXAMPLE 5: LIBRARY SYNTHESIS AND SCREENING 

This example is illustrated schematically in Figure 9. Briefly, a single population of amine derivatized beads (pre- 
pared as described in Example 4) was coupled to glycine. The population was then divided into two equal parts, and 

45 each part was labeled with a characteristic oligonucleotide that would uniquely identify the bead subpopulation. The 
subpopulation that had been labeled with an oligonucleotde 95 bases in length (the 95 mer described in Example 4) 
was subsequently coupled to the peptide YGGFL. The population of beads that had been labeled with an oligonucleotide 
110 bases in length (the 110 mer described in Example 4) was coupled to the peptide FLFLF. (SEQ ID NO:16) The 
beads were then mixed in the ratio of twenty FLFLF/110 mer beads for each YGGFL/95 mer bead (i.e., 20:1) and 

50 stained with a fluorescently labeled antibody (3E7) that binds the peptide sequence YGGFL with high affinity. Individual 
fluorescently stained beads and unstained beads were sorted directly into PCR tubes. Upon PCR, all the fluoresently 
stained beads gave rise to a fragment of amplified DNA 95 base pairs in length, and all the unstained beads gave rise 
to a fragment 110 base pairs in length. 

55 a. Peptide Coupling Step #1 

To Fmoc-Gly (Fmoc = 9-fluorenylmethoxycarbonyl amine protecting group; 0.1 mmole) was added HBTU (0.1 
mmole), HOBT (0.1 mmole), 1 .0 ml of 10% DMF in methylene chloride, and DIEA (0.3 mmole). After vortex treatment 
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to dissolve the solids, 0.4 mL of the solution containing the activated amino acid was added to 50 mg of diamine 
derivatized beads. The reaction mixture was vortexed 30 min. and then diluted with DMF, centrifuged, decanted, and 
the bead pellet washed twice with 1.0 ml of DMR The coupling reaction was then repeated. The beads were then 
treated with 1 .0 ml of 30% piperidine in DMF with vortexing for 1 h to deprotect the glycine amino group. 

B. Oligonucleotide Labeling 

Two different target oligonucleotides were employed in this experiment: the 95 mer and 110 mer described in 
Example 4. Half of the bead sample described above (25 mg) was labeled with the 95 mer, and the other half was 
labeled with the 1 1 0 mer. These oligonucleotides are composed of 2'-deoxy-cytidine, thymidine, and 2'-deoxy-7-deaza- 
adenosine. The oligonucleotides were synthesized with a primary amino group on the 5'-terminus (MMT-C12-Ami- 
nomodifier, Clonetech Laboratories, Inc.). Lyophilized oligonucleotide (1.5 nmole) was dissolved in 10 uJ of 0.5 M Na- 
phosphate, pH 7.7, and the solution was then treated with 20 jil of 0.2 M disuccinimydylsuberate (DSS). The reaction 
proceeded 1 0 min., and then, 70 uJ of ice-cold water were added. Unreacted DSS was removed by centrifugation. The 
supernatant was passed through a G-25 spin column that had been equilibrated with water. The eluant was immediately 
frozen and lyophilized to isolate the S'-N-hydroxysuccinamide ester of the oligonucleotide. This activated oligonucle- 
otide was dissolved in 100 jil of 0.1 M Na-phosphate, pH 7.5, which contained 0.1 mg/ml of sonicated salmon sperm 
DNA. This solution was added to 25 mg of glycine-coupled beads. After vortex treatment for 3 h, the beads were 
washed twice with 0.4 ml of 0.1 M Na-phosphate, pH 7.5, and twice with 0.4 ml of 0.1 N NaOH. Finally, the beads were 
washed three times with 0.4 ml of pH 7.5 buffer. 

C. Peptide Coupling Step #2 

To Boc-YGGFL or Boc-FLFLF (Boc = t-butoxy-carbonyl amine protecting group; 0.02 mmole) was added HBTU 
(0.02 mmole), HOBT (0.02 mmole), 0.190 ml of 10% DMF in methylene chloride, and DIEA (0.06 mmole). After vor- 
textreatment to dissolve the solids, the solution was diluted ten-fold in 10% DMF in methylene chloride. An aliquot of 
this solution (0.345 ml) was added to the glycine-coupled and oligonucleotide-labeled beads (25 mg). The solution 
containing Boc-YGGFL was added to beads labeled with the 95 mer, and the solution containing Boc-FLFLF was added 
to beads labeled with the 11 0 mer. The reaction mixtures were vortexed 30 min. and then diluted with DMF, centrifuged, 
decanted, and the bead pellets washed three times with 1 .0 ml of THF. The Boc protecting groups were removed by 
treating the beads with 0.4 mL of 95% trifluoroacetic acid for 10 min. The deprotection reaction was then diluted with 
THF, centrifuged, decanted, and the beads washed three times with 1.0 ml of DMF. Finally, the beads were washed 
three times with 0.5 ml of 0.1 M Na-phosphate, pH 7.5, and stored as a slurry (10 mg/ml). 

D. Mixing, Staining, and Sorting 

The beads coupled to the 110 mer and FLFLF were mixed with the beads that were coupled to the 95 mer and 
YGGFL in the ratio of 20:1 . Thus, 0.1 mg of 95 mer/YGGFL beads (2 million beads) were mixed with 2.0 mg of the 110 
mer/FLFLF beads (40 million beads). The mixture was suspended in blocking buffer (PBS, 1% BSA, 0.05% Tween- 
20) and incubated at room temperature for 1 h. The beads were next pelleted by centrifugation and resuspended in a 
solution of an FITC-labeled monoclonal antibody (3E7) that recognizes the peptide sequence YGGFL (1 |ig/ml). The 
suspension was incubated 0.5 h on ice and then centrifuged to isolate the bead pellet. 

The beads were resuspended in PBS for delivery into the fluorescence activated cell sorting (FACS) instrument 
(Becton Dickinson FACSORT Plus). Beads that had bound to the fluorescently labeled antibody were identified by their 
acquired fluorescence (see Figure 10), and homogeneous samples of either fluorescent or non -fluorescent beads were 
isolated by sorting into PCR tubes. One, ten, or one hundred beads of each type were sorted into each PCR tube. 

E. PCR of Sorted Beads 

To each PCR tube containing a bead, or beads, was added 25 uL of PCR mix (20 mM Tris-HCI, pH 8.7, 10 mM 
KCL, 10 mM (NH 4 ) 2 S0 4l 2 mM MgCI 2 , 0.1% Triton X-100, 0.14 mg/ml BSA, 200 u.M dATP, 200 jxM dGTR 200 uM 
dCTR 200 u.M dTTP, 2 u.M of each primer (as described in Example 4), and 0.5 units of Pfu DNA polymerase). Reactions 
were subjected to 40 ycles of 95°C for 0.5 min., 55°C for 1 min., and 72°C for 1 min. Gel loading dye (2 u.1) was added 
to 10 ul of each PCR and the sample run on a 2% low melting point agarose gel. DNA fragments were visualized by 
staining with ethidium bromide and exposure to UV light. Six single bead samples, three 1 0 bead samples, and three 
100 bead samples were amplified from both the fluorescent and non-fluorescent populations. AM the bead samples 
from the flourescent population produced only DNA fragments 95 base pairs in length, and all the samples from the 
non-fluorescent population produced only fragments 110 base pairs in length (see Figure 11). 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (j) APPLICANT: 

DOWER, WILLIAM J 
BARRETT, RONALD W 
GALLOP, MARK A 
10 NEEDELS, MICHAEL C 

(ii) TITLE OF INVENTION: METHOD OF SYNTHESIZING DIVERSE COLLECTIONS OF OLIGOMERS 

(iii) NUMBER OF SEQUENCES: 16 

15 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: TOWNSEND AND TOWNSEND 

(B) STREET: 1 MARKET PLAZA, STEUART TOWER, SUITE 2000 
20 (C) CITY: SAN FRANCISCO 

(D) STATE: CALIFORNIA 

(E) COUNTRY: USA 

(F) ZIP: 94105 

25 ( V ) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

30 (D) SOFTWARE: Patentln Release #1 .0, Version #1 .25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 
35 (B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

40 (A) NAME: Smith, William M. 

(B) REGISTRATION NUMBER: 30,223 

(C) REFERENCE/DOCKET NUMBER: 11509-36-1 

(fx) TELECOMMUNICATION INFORMATION: 

45 

(A) TELEPHONE: 415-543-9600 

(B) TELEFAX: 415-543-5043 

(2) INFORMATION FOR SEQ ID NO:1: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

ss (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 



CTTTCTTCCT CTCCCTCTTT TCTCCTCTCT TTTTTTCTCC TTCTTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC CCCTTTCTCT CCTTTCCTCC TCTCCTCTCT CTCTTCTTTC C 111 



(2) INFORMATION FOR SEQ ID NO:2: 
10 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

20 

CTTTCTTCCT CTCCCTCTTT TCTCCTCTTC TTTTTTCTCC TTTCTTTTTT CTCTCCCTCT 60 
CTCCTCTCTC CCCTTTCTCT CCTTTCCTCC TCTCCTCTCT CTCTTCTTTC C 111 

25 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

30 

(A) LENGTH: 115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

40 

CTTTCTTCCT CTCCCTCTTT TCTCCTCTTT CTTTTTCTCC TTTTCTTTTT CTCTCCCTCT 60 

CTCCTCTCTC TCTTCCTTTC CCCTCTCTCT CTCCTCTCCT CTCTCT CTTC TTTCC 115 



45 (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



26 



EP 0 604 552 B1 

CTTTCTTCCT CTCCCTCTTT TCTCCTCTTC TTTTTTCTCC TTTCTTTTTT CTCTCCCTCT 
CTCCTCTCTC TCTTCCTTTC CCCTCTCTCT CTCCTCTCCT CTCTCTCTTC TTTCC 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

GGAAAGAAGA GAGAGAGGAG AGG 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

AGAGAGGGGA AAGGAAGA 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

AGGAAAGGAG AGAAAGGG 

(2) INFORMATION FOR SEQ ID NO:8: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

CCACTCACTA CCACTCTACT ATAACCACCC CTTCCTATTC CAAAATTACA AACTTATCT C 
AACTACATCT CACACTCACT CATCTCTACA TCTAC 

(2) INFORMATION FOR SEQ ID NO:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 110 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CCACTCACTA CCACTCTACT ATAACCCTCC CCTATTCCAA AATTACATCC TATTCCAAAA 
TTACAAACTT AT CTCAACT A CATCTCACAC TCACT CATCT CTACATCTAC 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Gly Gly Phe Leu 
1 

(2) INFORMATION FOR SEQ ID NO:11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 

5 

Pro Gly Phe Leu 

1 



(2) INFORMATION FOR SEQ ID NO:12: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

20 

Tyr Gly Gly Phe Leu 
1 5 

25 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



Tyr Pro Gly Phe Leu 

40 15 



(2) INFORMATION FOR SEQ ID NO: 14: 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 

55 

Pro Gly Gly Phe Leu 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 



Pro Pro Gly Phe Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 



Phe Leu Phe Leu Phe 

1 5 



Claims 

1. The use of identifier tags to enable subsequent identification of reactions through which members of a library of 
different synthetic compounds have been synthesised in a component by component fashion and consequent 
deductive structural identification of said members. 

2. A library of different synthetic compounds, which compounds are obtainable by synthesis in a component by com- 
ponent fashion which links each compound to one or more identifier tags which enable subsequent identification 
of reactions through which said components were incorporated and consequent deductive structural identification 
of said members. 

3. A library of claim 2 comprising a plurality of different members, each member comprising an oligomer composed 
of a sequence of monomers linked to one or more identifier tags identifying the sequence of monomers in said 
oligomer. 

4. A library of claim 3, wherein said linkage between said oligomer and said identifier tag comprises a solid support. 

5. A library of claim 3 or claim 4, wherein said identifier tag is attached to said oligomer. 

6. A library of any one of claims 3 to 5 that has about 10 6 different members. 

7. A library of any one of claims 3 to 6, wherein said oligomer is a peptide or an oligonucleotide. 
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8. A library of any one of claims 3 to 7 wherein said identifier tag is a fluorescent marker. 

9. A tagged synthetic oligomer library produced by synthesizing on each of a plurality of solid supports a single 
oligomer sequence and one or more identifier tags identifying said oligomer sequence, said oligomer sequence 

5 and identifier tags synthesized in a process comprising the steps of: 

(a) apportioning said supports among a plurality of reaction vessels; 

(b) exposing said supports in each reaction vessel to a first oligomer monomer and to a first identifier tag; 

(c) pooling said supports; 

10 (d) apportioning said supports among a plurality of reaction vessels; 

(e) exposing said supports to a second oligomer monomer and to a second identifier tag; and 

(f ) repeating steps (a) through (e) from at least one to twenty times. 

10. A method of preparing a tagged synthetic oligomer library comprising a plurality of different members, each member 
15 comprising a solid support having attached a single oligomer sequence and one or more identifier tags identifying 

said oligomer sequence, said method comprising the steps of: 

(a) apportioning said supports among a plurality of reaction vessels; 

(b) reacting said supports in each reaction vessel with a first oligomer monomer; 
20 ( C ) reacting said supports in each reaction vessel with a first identifier tag; 

(d) pooling said supports; 

(e) apportioning said pooled supports among a plurality of reaction vessels; 

(f) reacting said pooled supports in each reaction vessel with a second oligomer monomer; 

(g) reacting said pooled supports in each reaction vessel with a second identifier tag; and 
25 (h) repeating steps (a) through (g) from at least one to twenty times. 

11. A solid support comprising a first particle attached to a second particle, said first particle linked to an oligomer and 
said second particle linked to an oligonucleotide identifier tag, and wherein said oligomer is other than an oligo- 
nucleotide. 

30 

12. A method of recording each step in a sequence of oligomer monomer additions in the synthesis of an oligomer 
library, the method comprising adding an identifier tag in conjunction with the addition of each monomer, and 
performing at least two cycles of monomer and tag addition, thereby forming a series of identifier tags identifying 
said oligomer sequence. 

35 

13. A method of claim 1 2, wherein after the addition of a first identifier tag, subsequent identifier tags are attached to 
the terminus of preexisting tag. 

14. A method of claim 12 or claim 13 and for synthesizing a peptide and an oligonucleotide on a solid support, said 
40 method comprising: 

(a) preparing a bifunctional solid support containing a first type of active site blocked with a first type of pro- 
tecting group and a second type of active site blocked with a second type of protecting group; 

(b) reacting said solid support with an activator to remove said first type of protecting group thereby exposing 
45 said first type of active site; 

(c) coupling an oligonucleotide monomer or an oligonucleotide to said first type of active site; 

(d) reacting said solid support with an activator to remove said second type of protecting group thereby ex- 
posing said second type of active site; 

(e) coupling a peptide monomer or peptide to said second type of active site; and 
50 (f ) repeating steps (b) through (e) from one to twenty times. 

15. An oligomer library which is obtainable by a process comprising a method of any one of claims 12 to 14. 



55 PatentansprGche 

1 . Die Verwendung von Identifizierungs-Tags, um eine nachfolgende Identifizierung von Reaktionen zu ermoglichen, 
durch die Mitglieder einer Bibliothek von verschiedenen synthetischen Verbindungen in einer Baustein-bei-Bau- 
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stein-Art synthetisiert worden sind, und darauf folgende deduktive strukturelle Identifizierung der Mitglieder. 

2. Eine Bibliothek von verschiedenen synthetischen Verbindungen, wobei die Verbindungen erhaltlich sind durch 
Synthese in einer Baustein-bei-Baustein-Art, die jede Verbindung zu einem Oder mehreren Identifizierungs-Tags 

s verknupft, die die nachfolgende Identifizierung von Reaktionen ermoglicht, durch die die verbindungen eingeglie- 

dert wurden, und darauf folgende deduktive strukturelle Identifizierung der Mitglieder. 

3. Bibliothek nach Anspruch 2, umfassend eine Vielzahl von verschiedenen Mitgliedern, wobei jedes Mitglied ein 
Oligomer umfaGt, zusammengesetzt aus einer Sequenz von Monomeren verbunden mit einem Oder mehreren 

10 Identifizierungs-Tags, die die Sequenz der Monomere in dem Oligomer identifizieren. 

4. Bibliothek nach Anspruch 3, worin die Bindung zwischen dem Oligomer und dem I dent ifizie rungs-Tag einen festen 
Trager umfaGt. 

'5 5. Bibliothek nach Anspruch 3 Oder 4, worin das Identifizierungs-Tag an dem Oligomer angeheftet ist. 

6. Bibliothek nach einem der Anspruche 3 bis 5, die etwa 10 6 verschiedene Mitglieder hat. 

7. Bibliothek nach einem der Anspruche 3 bis 6, worin das Oligomer ein Peptid oder ein Oligonucleotid ist. 

20 

8. Bibliothek nach einem der Anspruche 3 bis 7, worin das Identifizierungs-Tag ein Fluoreszenzmarker ist. 

9. Eine synthetische Oligomer-Bibliothek mit Markierung, hergestellt durch Synthetisieren auf jeder von einer Vielzahl 
von festen Tragern eine einzelne Oligomersequenz und/oder mehrere Identifizierungs-Tags, die die Oligomerse- 

2S quenz identifizieren, wobei die Oligomersequenz und die Identifizierungs-Tags synthetisiert werden in einem Ver- 

fahren, umfassend die folgenden Schritte: 

(a) Verteilung der Trager unter einer Vielzahl von ReaktionsgefaGen, 

(b) Aussetzen der Trager in jedem ReaktionsgefaG einem ersten Oligomer-Monomer und einem ersten Iden- 
tifizierungs-Tag, 

(c) Zusammenlegen der Trager, 

(d) Verteilung der Trager unter einer Vielzahl von ReaktionsgefaGen, 

(e) Aussetzen der Trager einem zweiten Oligomer-Monomer und einem zweiten Identifizierungs-Tag-Mono- 
mer, und 

(f) Wiederholung der Schritte (a) bis (e) mindestens ein- bis zwanzigmal. 

10. Verfahren zur Herstellung einer synthetischen Oligomer-Bibliothek mit Markierung, umfassend eine Vielzahl von 
verschiedenen Mitgliedern, wobei jedes Mitglied einen festen Trager umfaGt, an den eine einzelne Oligomerse- 
quenz angeheftet ist und eine Oder mehrere Identifizierungs-Tags, die die Oligomersequenz identifizieren, wobei 

40 das Verfahren die folgenden Schritte umfaGt: 

(a) Verteilung der Trager unter einer Vielzahl von ReaktionsgefaGen, 

(b) Umsetzen der Trager in jedem ReaktionsgefaG mit einem ersten Oligomer-Monomer, 

(c) Umsetzen der Trager in jedem ReaktionsgefaG mit einem ersten Identifizierungs-Tag, 

(d) Zusammenlegen der Trager, 

(e) Verteilung der zusammengelegten Trager auf eine Vielzahl von ReaktionsgefaGen, 

(f ) Umsetzen der zusammengelegten Trager in jedem ReaktionsgefaG mit einem zweiten Oligomer-Monomer, 

(g) Umsetzen der zusammengelegten Trager in jedem ReaktionsgefaG mit einem zweiten Identifizierungs- 
Tag, und 

(h) Wiederholen der Schritte (a) bis (g) mindestens ein- bis zwanzigmal. 

11. Ein fester Trager, umfassend ein erstes Teilchen, angeheftet an ein zweites Teilchen, wobei das erste Teilchen an 
ein Oligomer und das zweite Teilchen an einen Oligonucleotid-ldentifizierungs-Tag gebunden ist und worin das 
Oligomer ein anderes ist als ein Oligonucleotid. 

55 

12. Verfahren zum Aufzeichnen jedes Schritts in einer Sequenz von Oligomer-Monomer- Additionen in der Synthese 
einer Oligomer-Bibliothek, wobei das Verfahren umfaGt das HinzufOgen eines Identifizierungs-Tags in Verbindung 
mit dem HinzufOgen jedes Monomers und Durchfuhrung von mindestens zwei Zyklen der Monomer- und Tag- 
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Addition unter Bildung einer Serie von ! dent ifizie rungs-Tags, die die Oligomersequenz identifizieren. 

13. Verfahren nach Anspruch 12, worin nach dem Hinzufugen eines ersten identifizierungs-Tags nachfolgend Identi- 
fizierungs-Tags an dem Ende des zuvor existierenden Tags angeheftet werden. 

5 

14. Verfahren nach Anspruch 12 oder 13 und zur Synthetisierung eines Peptids und eines Oligonucleotids auf einem 
festen Trager, wobei das Verfahren umfaBt: 

(a) Herstellung eines bifunktionalen festen Tragers, enthaltend einen ersten Typ einer aktiven Seite, blockiert 
10 mit einem ersten Typ einer Schutzgruppe und einen zweiten Typ einer aktiven Seite, blockiert mit einem zwei- 

ten Typ einer Schutzgruppe, 

(b) Umsetzen des festen Tragers mit einem Aktivator, um den ersten Typ der Schutzgruppe zu entfernen und 
dadurch Exponieren des ersten Typs der aktiven Seite, 

(c) Kupplung eines Oligonucteotid-Monomers oder eines Oligonucleotids an den ersten Typ einer aktiven Sei- 
15 te, 

(d) Umsetzen des festen Tragers mit einem Aktivator, um den zweiten Typ der Schutzgruppe zu entfernen 
und dadurch Exponieren zweiten Typs der aktiven Seite, 

(e) Kupplung eines Peptid-Monomers oder eines Peptids an den zweiten Typ der aktiven Seite, und 

(f) Wiederholung der Schritte (b) bis (e) ein- bis zwanzigmal. 

20 

15. Eine Oligomer-Bibliothek, erhattlich durch ein Verfahren umfassend ein Verfahren nach einem der AnsprOche 12 
bis 14. 



25 Revendicatlons 

1. Utilisation de marqueurs d' identification pour permettre Identification ulterieure de reactions par lesquelles des 
membres d'une collection de composes synthetiques differents ont ete synthetises d'une maniere operant com- 
posant apres composant et Identification structural deductive resultante desdits membres. 

30 

2. Collection de composes synthetiques differents, lesquels composes peuvent etre obtenus par synthese d'une 
maniere operant composant apres composant qui lie chaque compos6 a un ou plusieurs marqueurs d'identification 
qui permettent Identification ulterieure des reactions par lesquelles lesdits composants ont ete incorpores et Iden- 
tification structurate deductive resultante desdits membres. 

35 

3. Collection de la revendication 2, comprenant plusieurs membres differents, chaque membre comprenant un oli- 
gomere constitue d'une sequence de monomeres Ii6e a un ou plusieurs marqueurs d'identification identifiant la 
sequence des monomeres dans ledit oligomere. 

40 4. Collection de la revendication 3, dans laquelle ledit lien entre ledit oligomere et ledit marqueur d'identification 
comprend un support solide. 

5. Collection de la revendication 3 ou la revendication 4, dans laquelle ledit marqueur d'identification est attache audit 
oligomere. 

45 

6. Collection de Tune quelconque des revendications 3 a 5, qui compte environ 1 0 6 membres differents. 

7. Collection de Tune quelconque des revendications 3 a 6, dans laquelle ledit oligomere est un peptide ou un oligo- 
nucleotide. 



50 



8. Collection de Tune quelconque des revendications 3 a 7, dans laquelle ledit marqueur d'identification est un mar- 
queur fluorescent. 

9. Collection d'oligomeres synthetiques marques produite par synthese sur chacun de plusieurs supports solides 
d'une sequence d'oligomere unique et d'un ou plusieurs marqueurs d'identification identifiant ladite sequence d'oli- 
gomere, ladite sequence d'oligomere et les marqueurs d' identification etant synthetises dans un precede compre- 
nant les etapes suivantes : 
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(a) repartir lesdits supports parmi plusieurs recipients reactionnels ; 

(b) exposer lesdits supports dans chaque recipient reactionnel a un premier monomere d'oligomere et a un 
premier marqueur d'identification ; 

(c) rassembler lesdits supports ; 

5 (d) repartir lesdits supports parmi plusieurs recipients reactionnels ; 

(e) exposer lesdits supports a un second monomere d'oligomere et a un second monomere de marqueur 
d'identification ; et 

(f) repeter les etapes (a) a (e) d'au moins une a vingt fois. 

io 10. Methode de preparation d'une collection d'oligomeres synthetiques marqu6s comprenant plusieurs membres dif- 
ferents, chaque membre comprenant un support solide auquel est attachee une sequence d'oligomere unique et 
un ou plusieurs marqueurs d'identification identifiant ladite sequence d'oligomere, ladite methode comprenant les 
etapes suivantes : 

15 (a) repartir lesdits supports parmi plusieurs recipients reactionnels ; 

(b) faire reagir lesdits supports contenus dans chaque recipient reactionnel avec un premier monomere 
d'oligomere ; 

(c) faire reagir lesdits supports contenus dans chaque recipient reactionnel avec un premier marqueur 
d'identification ; 

20 (d) rassembler lesdits supports ; 

(e) repartir lesdits supports rassembles parmi plusieurs recipients reactionnels ; 

(f) faire reagir lesdits supports rassembles contenus dans chaque recipient reactionnel avec un second mo- 
nomere d'oligomere ; 

(g) faire reagir lesdits supports rassembles contenus dans chaque recipient reactionnel avec un second mar- 
25 queur d'identification ; et 

(h) repeter les etapes (a) a (g) d'au moins une a vingt fois. 

11. Support solide comprenant une premiere particule attachee a une seconde particule, ladite premiere particule 
etant liee a un oligomere et ladite seconde particule etant liee a un marqueur d'identification oligonucleotidique, 

30 et ledit oligomere etant autre chose qu'un oligonucleotide. 

12. Methode d'enregistrement de chaque elape d'une sequence d'additions de monomeres d'oligomere dans la syn- 
thase d'une collection d'oligomeres, la methode comprenant I'addition d'un marqueur d'identification conjointement 
a I'addition de chaque monomere, et I'execution d'au moins deux cycles d'addition de monomere et de marqueur, 

35 pour former ainsi une serie de marqueurs d'identification identifiant ladite sequence d'oligomere. 

13. Methode selon la revendication 12, dans laquelle, apres I'addition d'un premier marqueur d'identification, les mar- 
queurs d'identification suivants sont attaches a l'extremit6 du marqueur preexistant. 

40 14. Methode selon la revendication 12 ou la revendication 13 et pour synthetiser un peptide et un oligonucleotide sur 
un support solide, ladite methode comprenant : 

(a) la preparation d'un support solide bifonctionnel contenant un premier type de site actif bloque par un premier 
type de groupe protecteur et un second type de site actif bloque par un second type de groupe protecteur; 
45 (b) la reaction dudit support solide avec un activateur pour eliminer ledit premier type groupe protecteur en 

exposant ainsi ledit premier type de site actif ; 

(c) le couplage d'un monomere d'oligonucleotide ou d'un oligonucleotide audit premier type de site actif ; 

(d) la reaction dudit support solide avec un activateur pour eliminer ledit second type de groupe protecteur en 
exposant ainsi ledit second type de site actif ; 

50 (e) le couplage d'un monomere de peptide ou d'un peptide audit second type de site actif ; et 

(f) la repetition des etapes (b) a (e) d'une a vingt fois. 

15. Une collection d'oligomeres qui peut etre obtenue par un procede comportant une methode de Tune quelconque 
des revendications 12 a 14. 

55 
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O-DMT 



r/a 7. 
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LINKER 



II) DEPROTECT HYDROXYLS 
(2) BUILD 3* PGR SITE 



0-OMT 



Fmoc-N 1 3' SITE Y 0- DMT 



(1) DEPROTECT AMINES 

(2) COUPLE AA| 

13) DEPROTECT HYDROXYLS 
(4) BUILD •CODONi" 



Fmoc-AA,- 1 S SITE )C0D7> Q-DMT 



I 



REPEAT PEPTIDE AND NUCLEOTIDE COUPLINGS 

(I) DEPROTECT HYDROXYLS 

12) BUILD 5* PCR SITE 

(3) OEPROTECT OLIGO/PEPTIDE 



H 2 N-AA n -AA 2 AA| Q 1 3* SITE ICODiTCODTVHCODnl 5' SITE | -0H 



FIG. 8. 
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